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READING ABILITIES OF BRIGHT AND DULL 
CHILDREN OF COMPARABLE MENTAL 
AGES’ 


EMERY P. BLIESMER 


University of Texas 


In a recent study by Kolstoe (2), the usefulness of mental age 
as a unit of measurement was investigated by comparing the 
mental abilities of bright and dull children of similar mental ages. 
In general, his results revealed only minor differences in the traits 
measured. The present investigation is a companion study which 
involved essentially the same samples of bright and dull children 
as those employed by Kolstoe. Its purpose was to determine the 
extent to which children of equal mental age but markedly differ- 
ent in chronological age and IQ tend to be alike with respect to 
achievement in reading. This was done by comparing bright and 
dull children with approximately equal estimated true mental 
ages with respect to each of several abilities involved in reading 


‘comprehension. The criterion for ‘dull’ was an estimated true 


Stanford-Binet IQ of 84 or below; the one for ‘bright’ was an 
estimated true Stanford-Binet IQ of 116 or above. Thus the 
minimum separation between the two groups was approximately 
two standard deviation units along the IQ continuum. Estimates 
of the true IQ’s and MA’s were obtained by applying the standard 
formula for correction for regression to the obtained IQ’s and 
MA’s, with the reliability coefficients used in the formula being 
those reported by Terman and Merrill for various IQ levels.’ 
Results of the extensive investigations of Lewis (3) and of 





' This article is based on the writer’s Ph.D. dissertation. The work was 
done at the State University of Iowa under the co-direction of Professors 
A. N. Hieronymus and J. B. Stroud. 

*L. M. Terman and M. A. Merrill, Measuring Intelligence, Boston: 
Houghton Mifflin Co., 1937, p. 46. 
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McGehee (4), earlier studies by Almack and Almack (1) and by 
Van Wagenen (8), and a later study by Thomas (6), suggest that, 
in general, bright children achieve below, and dull children achieve 
above, levels consistent with their indicated mental ages. One 
might expect, therefore, that a direct comparison of the achieve- 
ment of bright and dull children with approximately the same 
mental ~ges would show results favoring the latter. The fact 
that the older, low IQ children have been in school longer and 
have been exposed to instruction in certain specific reading skills 
not taught in the lower grades would also lend support to such an 
hypothesis. On the other hand, most of the investigations of the 
relationship between brightness and over- and under-achievement 
have failed to take into consideration regression effects resulting 
from lack of reliability in the mental age measures, which would 
account for some of the observed differences. Ceiling and floor 
effects, and procedures used in extrapolating age and grade scales 
on standard reading tests may also be responsible for finding very 
little over-achievement among high IQ groups and under-achieve- 
ment among groups of low IQ pupils. Thus, assuming equal 
variability of reading age scores and mental age scores, it is virtu- 
ally impossible for a pupil at or near the 99th percentile on an 
intelligence test to ‘over-achieve’ or a pupil near the first percentile 
to ‘under-achieve’. 

In a study by Ramaseshan (4), ninth-grade pupils of similar 
mental ages (Chicago Tests of Primary Mental Abilities) were 
grouped according to chronological age. Children with similar 
mental ages but different chronological ages (and, consequently, 
differing IQ’s) were compared on the subtests of the Jowa Tests of 
Educational Development. Her bright group was found to excel 
on all of the subtests of the ITED, but differences were statistically 
significant for only three of these subtests. Of some pertinence 
to the purposes of the writer’s study is the fact that no significant 
differences were found on the three subtests measuring the ability 
to read and interpret material in specific content areas. 

Unsicker (7) equated groups according to mental age by match- 
ing groups of bright third- and fourth-grade pupils with groups 
of dull seventh- and eighth-grade pupils, first on the basis of 
Kuhlman-Anderson Intelligence Test scores, then on the basis of 
California Tests of Mental Maturity results. When the groups 
matched on the basis of Kuhlman-Anderson MA’s were compared 
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with respect to reading comprehension scores earned on the Jowa 
Tests of Basic Skills, Unsicker found significant differences favor- 
ing the bright group. Differences between the groups matched on 
the basis of California MA’s were not significant. 

In addition to the fact that different criterion tests were em- 
ployed, the present study differs from most of those in which the 
problem has been investigated previously in that estimated true 
scores were used in the matching procedure, individually admin- 
istered intelligence tests rather than group tests were employed, 
and the criterion tests were of an appropriate level of difficulty so 
that ceiling and floor effects did not operate to bias the means. 


PROCEDURE 


Method of Sampling.—In order to measure a wide range of 
comprehension abilities, it was decided to employ test materials 
which are appropriate at the fourth- to fifth-grade level of reading 
ability. Considering the IQ criteria for ‘bright’ and ‘dull’ (lower 
and upper IQ limits of 116 and 84, respectively), and studying the 
overlap of MA’s at various CA’s for these bright and dull criteria, 
it was decided to use a mental age range of from ten years, seven 
months, through twelve years, six months, and to restrict the 
bright group to children with CA’s of ten years or less and the dull 
group to children with CA’s of fourteen years or above. 

Children in the dull group were selected from regular eighth- 
and ninth-grade classes and some special education classes in two 


._ junior high schools, and children in the bright group from regular 


third- and fourth-grade classes in three elementary schools, in the 
public school system of a large Iowa city. For identification of 
pupils likely to meet sample specifications, the cumulative record 
folder of each child in the two junior high schools, and in third 
and fourth grades in the three elementary schools, was studied. 
A list was made of all the junior high pupils’ who were fourteen 
years of age or older and for whom IQ’s of 90 or below had been 
obtained with group intelligence tests which had been administered 
by the schools in previous years. In the elementary schools, a 
list was made of all third- and fourth-grade pupils who were ten 
years of age or younger and for whom IQ’s of 110 or above had 
been obtained with group intelligence tests in previous years. 
From the list of pupils indicated as likely to meet specifications 
for the dull group, pupils were selected randomly and the Revised 
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TaBLeE I.—CHARACTERISTICS OF THE SAMPLES 

















Groups 

Bright Dull 
Range of CA’s 8-7 through 9-10 14-2 through 16-3 
Mean CA 9-2.5 15-5.4 
Range of MA’s (est. true) 10-8 through 12-6 | 10-8 through 12-6 
Mean MA 11-3.2 11-3.0 
Range of IQ’s (est. true) 116 through 138 72 through 84 
Mean IQ 126.5 79.5 





Stanford-Binet Scale, Form L, was administered until there were 
obtained twenty-nine pupils who met the following specifications: 

1) Estimated true IQ’s of 84 or below. 

2) Estimated true MA’s of from ten years, seven months, 
through twelve years, six months. 

From the list of third- and fourth-grade pupils indicated as 
likely to meet sample specifications for the bright group, pupils 
were selected randomly and the Revised Stanford-Binet Scale, 
Form L, was administered until there was obtained a sample of 
pupils meeting the following specifications: 

1) Estimated true I1Q’s of 116 or above. 

2) Estimated true MA’s of from ten years, seven months, 
through twelve years, six months. 

3) As many bright children in each of four six-months mental 
age intervals or levels (which constituted the two-year range 
indicated in the preceding specification) as there were dull children 
in that interval. 

In the process of obtaining enough bright cases in each level to 
match the number of dull cases in that level, extra cases were 
obtained for some of the levels. For purposes of statistical analy- 
sis, extra cases in each level were later discarded randomly. The 
final sample contained twenty-eight children in each group, dull 
and bright, one case originally selected for the dull group having 
been lost because of incomplete data. In the dull group, there 
were fifteen boys and thirteen girls; thirteen in Grade 8, six in 
Grade 9, and nine in special classes. The bright group was com- 
posed of sixteen boys and twelve girls; fifteen in Grade 3 and 
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thirteen in Grade 4. A summary of information relative to the 
dull and bright groups in the sample is presented in Table I. 

Selection of Comprehension Abilities Investigated.—A survey of 
the professional literature, teachers’ manuals accompanying series 
of readers, standardized tests, and reported results of factorial 
studies was made for suggestions of specific abilities involved in 
reading comprehension. Consideration was also given to availa- 
bility of measuring instruments for these specific abilities, the 
possibilities of adapting available instruments to the purposes of 
this study, and the purported importance of given abilities. This 
resulted in the selection of the following abilities for inclusion in 
this investigation: 

1) Word Recognition (the ability to recognize given words 
‘on sight’). 

2) Word Meaning (the ability to understand or recognize the 
particular meaning of a word as it is used in context). 

3) Memory for Factual Details (the ability to recall specific 
facts which have been definitely stated in a selection). 

4) Location or Recognition of Factual Details (the ability to 
locate or recognize specific factual details which are explicitly 
stated in a given selection). 

5) Perception of Relationships among Definitely Stated Ideas 
(the ability to recognize or to formulate an idea which is not 
explicitly stated in a selection but which is contained in the selec- 
tion when two or more definitely stated ideas are considered to- 


- gether). 


6) Recognition of Main Ideas (the ability to recognize the 
central thought or main idea of an entire selection, a paragraph, 
or a specific part of a paragraph). ‘ 

7) Drawing Inferences and Conclusions (the ability to recognize, 
or to formulate, an idea which is not stated in a selection but which 
is dependent upon the combination of an idea (or ideas) which is 
(are) definitely stated in a selection and one which is outside the 
selection and within the informational or experiential background 
of the individual). 

In addition to these abilities, measures of reading rate and 
listening comprehension were also obtained. Reading rate scores 
were secured not only because comparisons between the groups 
were of some interest in themselves, but also because marked 
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differences in rate could be responsible for differences in compre- 
hension abilities, even though the tests were untimed. The 
measures of listening comprehension were obtained in order to 
determine whether differences in general comprehension ability 
exist when unencumbered by possible difficulties with mechanical 
skills in reading. 

Criterion Tests Used.—Eighty words from the Flashed Word 
Recognition and Word Analysis Test of the Durrell Analysis of 
Reading Difficulty, and twenty more difficult words from various 
forms (Q through T) of Part II (Vocabulary) of the Reading 
Comprehension Test, Advanced Battery, of the Jowa Tests of 
Basic Skills, were included in the Word Recognition Test. These 
one hundred words were arranged in lists of twenty each and were 
presented tachistoscopically for a duration of approximately one 
second. Only results obtained with the last four lists (eighty 
words) were included in the analysis of results. 

The Word Meaning Test was a multiple response test consisting 
of a representative sample of fifty items chosen from various 
forms (Q through T) of Part II (Vocabulary) of the Reading 
Comprehension Test, Elementary and Advanced Batteries, of the 
Iowa Tests of Basic Skills. 

A test of Comprehension Abilities was made up of nine reading 
selections and one hundred and thirty items which were adapted 
from the Reading Comprehension Test of the Elementary and 
Advanced Batteries of the Jowa Tests of Basic Skills, Forms L 
through T. Five subtest scores, each based on from twenty-five 
to twenty-seven items, were obtained as measures of these five 
specific abilities: Memory for Factual Details, Location or Recog- 
nition of Factual Details, Perception of Relationships among 
Definitely Stated Ideas, Recognition of Main Ideas, and Drawing 
Inferences and Conclusions. The reading selections were chosen 
on the basis of relative difficulty, apparent interest value, and the 
extent to which items accompanying selections represented the 
specific abilities named above. To obtain subtests of approxi- 
mately equal length and difficulty, a number of original items were 
eliminated and additional ones were constructed when necessary. 

A Listening Comprehension Test was constructed in a manner 
similar to that for the Test of Comprehension Abilities, except 
that items for Location or Recognition of Factual Details were not 
included. The test consisted of four reading selections and forty- 
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two items. The test booklets contained only the questions 
related to the selections. The selections and the test items were 
read aloud to the subjects by the examiner; and subjects marked 
their chosen responses to each question after it and its answer 
choices had been read. 

The Reading Rate Test consisted of a selection of approximately 
fifteen hundred words of fourth- to fifth-grade level of difficulty. 
Subjects were instructed to read the selection once ‘“‘at the same 
speed as you usually read.” They were told before they began 
that there would be questions over the material read. A short 
comprehension test, consisting mainly of items measuring memory 
for details, followed the reading of the selection. The rate score 
was a complement of the number of complete ten-second intervals 
which had elapsed during the reading of the selection. 

All of the criterion tests were administered by the writer during 
an eight-day testing period. All were administered as group 
tests, without time limits, with the exception of the Word Recog- 
nition Test, which was administered to each subject individually. 
All test items, except those for the Word Recognition Test, were 
of the four-choice multiple-response type. Except for the Reading 
Rate Test rate scores, all scores used in the analyses of results 
represented the number of items answered correctly. 

Analysis of Resulis—An analysis of variance design, ‘group-by- 
levels’, was employed in the analysis of results.* The bright and 
dull children represent the ‘groups,’ and the intervals of six 
months in the two-year mental age range used in the study con- 
stitute the ‘levels.’ A schematic presentation of the groups-by- 
levels design as it applies to this study is shown below. 











Criterion Scores 
MA Level MA (Es’d True) 
Bright Group Dull Group 
I 12-1 to 12-6 n=7 n=7 
IT 11-7 to 12-0 n=9 n=9 
III 11-1 to 11-6 n=7 n=7 
IV 10-7 to 11-0 n= 5 n= 5 














In the case of each ability investigated, the null hypothesis was 





8. F. Lindquist, Design and Analysis of Experiments in Psychology and 
Education. Boston: Houghton Mifflin Co., 1952. 
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TaBLE II.—Summary or Resutts: Mean Scores ror Eacu Grovp, 
DUFFERENCES BETWEEN MEANS, AND F-VALUES, FOR EACH 











ABILITY 
Bright Dull . 
Ability Ft F* 
Mean SD Mean SD 
Word Recognition 58.6 | 11.8 | 56.2 | 13.0 +2.4 | 0.499 
Word Meaning 30.4 | 6.1 | 30.8} 6.1 —0.4 | 0.076 
Memory for Factual De-| 17.4 | 4.0/15.1| 4.8 +2.3 | 3.678 
tails 


Location or Recognition | 15.8 | 3.6 | 12.7] 3.3 +3.1 | 10.126** 
of Factual Details 
Perception of Relation- | 15.4 | 3.6 | 14.0) 3.6 +1.4 | 2.058 
ships among Definitely 
Stated Ideas 

Recognitionof MainIdeas | 16.3 | 4.1 3 
Drawing Inferences and | 14.9; 3.1 /| 12.9) 3. 
Conclusions 
Total Comprehension | 79.8 | 15.1 | 68.4 | 16.7 | +11.4 | 7.922** 
Abilities 


+2.4 | 5.319** 
+2.0 | 4.628** 


Listening Comprehension | 30.6 | 3.6 | 23.6 | 4.7 +7.0 | 47.574** 
Reading Rate (rate score) | 61.6 | 17.9 | 55.4 | 15.9 +6.2 1.602 
Reading Rate (compre- | 10.7 | 2.4; 9.8/| 3.4 +0.9 | 1.444 


hension score) 























* For each F-value, df = 1, 48; Fos = 4.04. 
** Significant at 5 per cent level. 


tested, t.e., that the means of the populations of which the dull 
and bright groups were representative samples were the same. To 
test this hypothesis, the ratio of the mean square for groups to 
the mean square for within cells was employed. This ratio yields 
a value which is distributed as F, provided that the hypothesis is 
true and that certain conditions are met. A five per cent coeffi- 
cient of risk, selected in advance of the analyses, was employed in 
rejecting the null hypothesis. 


RESULTS 


A summary of the obtained results is presented in Table II. 
The table includes the mean and standard deviations for each 
group, the differences between the means, and the F-values ob- 
tained in the tests of the significance of the differences between 
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the means of the bright and dull groups. Positive differences favor 
the bright group. 

Obtained differences between the mean scores of the two groups 
favored the bright group with respect to all of the abilities except 
Word Meaning. In the case of this one exception, the difference 
was not only nonsignificant, but also less than one raw score unit. 
The analyses of variance yielded significant differences with 
respect to the following five abilities: Location or Recognition of 
Factual Details, Recognition of Main Ideas, Drawing Inferences 
and Conclusions, Total Comprehension Abilities, and Listening 
Comprehension. While not significant at the required level, 
differences between the mean scores for Memory for Factual 
Details and for Perception of Relationships among Definitely 
Stated Ideas were substantial, and further investigation with 
respect to these two abilities seems warranted. Differences with 
respect to Word Recognition, Word Meaning, and Reading Rate 
were not significant. 

The Test of Listening Comprehension was included in anticipa- 
tion of the possible event that the group which was found signifi- 
cantly poorer on most of the abilities would be found to be signifi- 
cantly better with respect to listening comprehension. Then such 
results might have been interpreted in terms of possible difficulty 
with mechanical skills in reading rather than in terms of differences 
in intellectual abilities or specific comprehension abilities. How- 
ever, the bright group, which excelled on nearly all of the abilities, 
also excelled significantly on the Listening Comprehension Test. 
Similarly, if a significantly higher rate had been found for the 
group also found to be significantly poorer with respect to most of 
the abilities, the poorer showing might have been attributed, in 
part, to tendencies to read carelessly and too hastily. However, 
no significant rate difference was found between the two groups 
and analysis of the reading rate comprehension check indicated 
that the two groups had read the reading rate selection with 
comparable degrees of understanding. These findings with respect 
to reading rate and listening comprehension tend to further in- 
dicate that superiority in reading comprehension involves supe- 
riority in intellectual functions rather than in the more mechani- 
cal skills. 

Tests for interaction between groups and levels were also made. 
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No significant interaction effects were found for any of the abilities 
tested, thus satisfying one of the necessary conditions or under- 
lying assumptions involved in the particular design of the study. 
Inspection of the frequency distributions for the various abilities 
involved in this study revealed that, in general, obtained scores 
did not closely approach the maximum possible at the upper end 
of the distribution or the ‘chance’ scores at the lower end. Thus, 
neither ceiling nor floor effects operated to bias the results. 


CONCLUSIONS 


In the strictest sense, the sample studied may be regarded as a 
representative sample only of hypothetical populations that show 
the same relative distribution of MA’s, IQ’s, and CA’s as the groups 
in the sample itself; and generalizations based upon obtained results 
should be restricted to these hypothetical populations. However, 
since no significant interaction effects between groups and levels 
were found, restrictions upon extending generalizations to real 
populations may be lifted to a considerable extent. Therefore, 
generalizing to a population of dull and bright children with 
widely differing IQ’s but approximately equal MA’s within the 
MA ranges found in this study and with reference to the various 
comprehension abilities as defined operationally, the following 
conclusions seem warranted: 

1) Bright children are significantly superior to dull children of 
comparable mental ages with respect to total reading comprehen- 
sion and the following specific abilities: locating or recognizing 
factual details, recognizing main ideas, and drawing inferences and 
conclusions. 

2) It seems probable that bright children are also superior to 
dull children of comparable mental ages with respect to memory 
for factual details and perception of relationships among definitely 
stated ideas. 

3) Bright children are superior to dull children of comparable 
mental ages with respect to listening comprehension. 

4) Reading rates of bright and dull children of comparable 
mental ages appear to be approximately the same when comparable 
degrees of understanding of material read are attained, with a 
wide range in rate being found in both groups. 

5) Bright and dull children tend to be alike with respect to 
ability in word recognition and word meaning. Bright children 
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are significantly superior to dull children of comparable mental 
ages with respect to the relatively more complex, and intellectual, 
comprehension abilities. 

6) It would seem that levels of expectation with respect to the 
more complex comprehension abilities should not be as high for 
dull children as for bright children of comparable mental ages. 
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GROUP PROCEDURES MODIFYING 
ATTITUDES OF PREJUDICE IN THE 
COLLEGE CLASSROOM: 


GERALD 8. WIEDER 


Brooklyn College 
Community Counseling Center 


Since recent research has indicated the intimate relationship 
between prejudice and certain patterns of emotional development 
and expression, it would appear logical that one approach to 
modifying prejudice might be through techniques which focus 
primarily on many of the same emotional factors. Thus, within 
the setting of the college classroom an experiment was designed 
which would compare the relative effectiveness of two methods of 
instruction in modifying attitudes associated with racial, religious 
and ethnic prejudice: one method utilized such group therapy 
procedures as nondirective methods of instruction and socio- 
drama, while the other employed the traditional lecture-discussion 
method. A secondary purpose of this research was to investigate 
the relationship between personality adjustment and the modifica- 
tion of the social attitudes with which this study is concerned. 
Although not directly within the scope of this investigation, the 
subsidiary problem of course content acquisition for the two 
classes taught by the investigator was examined. 

Recent research on the problem of minority-group prejudice 
has indicated the close relationship between certain socially- 
determined personality variables and prejudice. It has also been 
demonstrated that client-centered and sociodramatic therapeutic 
procedures can be adapted to classroom situations. (2, 3, 4) Con- 
sequently, it would appear logical that one approach in modifying 
prejudice might be through utilizing group techniques in a class- 
room situation which focus primarily on many of the same per- 
sonality variables. 





1 This article is a summary of a Doctoral dissertation completed at New 
York University. It was sponsored by Professor Charles E. Skinner, to 
whom the investigator wishes to express his deep appreciation for the in- 
terest and assistance provided. 
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The experimental design was organized in a manner which 
would permit integration of the research techniques into the 
fabric of the courses. Three instruments were employed; namely, 
the California Public Opinion Study E-F Scales, the Index of 
Adjustment and Values, and the Minnesota Multiphasic Inven- 
tory, both before and after the experimental period, to measure 
any changes. An _ objective-type course-content examination 
prepared by the investigator was administered to the two experi- 
mental classes at the midpoint and ending of the semester. The 
experimental classes were conducted by the investigator, who 
randomly selected one group to be taught by the utilization of 
group therapy procedures and the other to be conducted by the 
traditional lecture-discussion method. A third class, taught by 
another instructor using traditional methods of instruction, 
served as one control. Finally, another group which was composed 
of students registered in various courses within the same curricu- 
lum served as a time-factor control. All groups were compared on 
the significant variables and appeared to come from the same 
parent population. 


THE PROCEDURE 


The actual steps taken in the organization of this research are 
as follows: 

A) In September, approximately one week prior to the begin- 
ning of the school semester, the California E-F Attitude Scales, 
the Minnesota Multiphasic Personality Inventory, and the Otis 
Self-Administering Tests of Mental Ability were administered to 
a group of students (Control Group I) entering the Vocational 
Diploma Program at Brooklyn College. 

B) During the second class session of the two sections of Practi- 
cal Psychology D1 (the group therapy class and the traditional 
lecture-discussion class), given by the investigator, and of a third 
section of the same course given by another instructor (Control 
Group IT), the E-F Attitude Scales and the Minnesota Multiphasic 
Personality Inventory were administered. The Otis Self-Adminis- 
tering Tests of Mental Ability was part of the entrance testing 
battery of all Vocational Diploma Students desiring admission to 
the College. 

C) The investigator conducted one section by the traditional 
lecture-discussion method and the other using group-therapy 
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procedures. The codperating instructor conducted his section by 
the traditional lecture-discussion method. The fourth group was 
distributed among many classes, many of the students not regis- 
tered for Practical Psychology D1. 

D) During the seventh class meeting the Bills’ Index of Adjust- 
ment and Values was administered to the two classes conducted 
by the investigator. 

E) During the last session of the semester the four groups were 
retested with the original test instruments, including the Index of 
Adjustment and Values, with the exception of the Otis Self-Ad- 
ministering Test of Mental Ability. 

Before- and after-testing of attitudes and personality was 
conducted in order to measure any changes which might have 
taken place as a result of the teaching methods used. The purpose 
of the control groups was to estimate the effect of factors other 
than those of the teaching methods used; i.e., bias of instructor 
and the many uncontrollable variables attendant on just being 
registered in college for a similar period of time. Thus, there were 
two kinds of comparisons possible to evaluate the effects of the 
educational methods used. First, an intra-group comparison 
consisting of noting differences in the same group at the beginning 
and end of the semester; and, second, the inter-group comparison 
of noting the differences among the four groups studied. 

Since any conclusions based upon this investigation are depend- 
ent in good part on the experimental variables introduced, it 
would seem necessary to offer some description of the differing 
methods used in treating the subject matter of the course. - 


TRADITIONAL LECTURE DISCUSSION METHOD 


The procedure and plan of instruction followed with the lecture- 
discussion section is familiar to most college graduate and under- 
graduate classes. The instructor had rather carefully-planned 
lecture material which he presented to the class in a more or less 
systematically directed fashion towards the end of the covering of 
a given amount of pertinent factual material. Student participa- 
tion was usually limited to isolated questions and even more 
isolated comments regarding the factual material presented. A 
great deal of note-taking was commonplace, with the usual cram 
sessions for mid- and end-term examinations. The following are 
representative class materials presented: 
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Third Meeting.—Heredity-environment including genetic ap- 
proach, cultural-anthropological approach, history of controversy, 
studies, present thinking. 

Twelfth Meeting. —Husband-wife relationships including author- 
ity and responsibility, the career woman, economic factors, in- 
laws, physiological factors and divorce. 


GROUP THERAPY METHOD 


The type of play drama used with this class was a modification 
of the psychodramatic and sociodramatic sequences developed and 
used by Moreno (5) and Jennings (4). During the first class meet- 
ing the students were asked to list five everyday situations either 
at home or school that caused them embarrassment, created 
tension, or evoked displeasure, anger or unhappiness. The students 
were then asked to indicate whether they would care to appear in 
a little play built around this situation which would be presented 
to the group. 

Many of the dramas selected for class use were drawn from those 
presented by the students. The criterion for selection, of necessity, 
was their applicability to the material of the course. Where there 
were none which would adequately touch on course content, the 
investigator created one appropriate to the topic of discussion. 

Each play-drama was structured by the instructor, but the 
students were encouraged to select their own réles and play them 
out spontaneously, making up lines as they went along. The 
procedure usually followed was to permit the players to terminate 
their own scenes, with a half-hour being arbitrarily selected as a 
maximum time limit. Following the drama, the ‘audience’ was 
then invited to comment on any phase of the drama that to them 
invited comment. Concentrated effort was made to keep the two 
courses fairly similar in terms of subject matter treated and in the 
amount of time devoted to each area covered. Hence the subject 
of prejudice itself was confined to one class session in each class. 

Examples of specific play-dramas used in connection with the 
development of course content paralleling the examples selected 
from the traditionally conducted class are as follows: 

* Heredity-Environment.—A young couple who have been seeing 
each other for the past year decide that they are deeply in love 
and wish to marry. The fly in the ointment, however, is that the 
boy’s father is serving time in the federal penitentiary for a crimi- 
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nal offense. The young man’s conduct and behavior have always 
been examplary, but the young lady is aware of the strong objec- 
tions of her parents to this contemplated marriage. They never- 
theless decide to have the matter out with the girl’s parents that 
night. 

Husband-Wife Relationships.—a) The younger brother never 
seems to be able to get dressed early enough to get to school on 
time. Father, in a fit of anger at this recurrent situation begins to 
argue with mother. b) Mother, unable to cope with the children‘ 
asks father to do something about it. Father answers by declaring 
it’s not his responsibility. 


RESULTS 


Table I presents the mean attitude scale scores, variability and 
t ratios of the therapy groups at the beginning and end of the 
semester. There were three attitude scale variables on which the 
significance of changes were tested. These were the E, F, and E-F 
scales. The E-F score was the combined prejudice score as meas- 
ured by the entire attitude scale. 

Inspection of Table I reveals that differences obtained from 
pre- to post-testing were significant for the group therapy class. 
More specifically, changes occurring on the E scale might be 


TaBLE I.—Means, STANDARD DEVIATIONS AND COMPARISONS OF PRE- 
AND POsT-TESTS ON THE E,F anp E-F ScaLes For 
THE THERAPY GROUP 











Pre-test Post-test Level of 
Scale N M:-M2! F we Signific- 
M o M o — 
EK 27 65.4 | 13.04 | 60.9 | 15.1 | 4.22 | 1.33*| 2.06 .05 
F 27 | 122.1 | 23.36 | 105.4 | 22.7 | 15.92 | 1.06*) 5.02 01 
E-F 27 | 187.1 | 31.57 | 167.1 | 30.0 | 20.46 | 1.10*| 6.46 .O1 
































Nore: Using the small sample techniques and entering the ‘‘t’’ table 
with 26, 27, and 28 (as the case may be) degrees of freedom the ratio is 
significant at the .05 level if it is over 2.00 and at the .01 level if it is over 
2.65 (H. Garrett, Statistics in Psychology and Education’’, New York: 
Longman, Green and Co., 1947, p. 190). 

1 The difference between each pre- and post-test score was calculated 
for each person on each item by subtracting the latter from the former. 
Means and standard deviations of these differences were then compared. 

* F is not significant at the .05 level of confidence. 
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expected, due to chance, in only five out of one hundred cases: 
changes in the F scale in only one out of a hundred cases, and for 
the combined scale significance is also well established in that such 
changes as occurred would probably occur by chance in only one 
out of one hundred cases. Inspection of F results reveals no signifi- 
cant difference in variability. 

Such changes as did occur on the E, F and combined E-F scales 
for the lecture-discussion, Control Groups I and II were not statis- 
tically significant. With the exception of Control Group II, F and 
combined E-F scales, the direction of change was towards a re- 
duction of prejudice. Changes in variability were likewise not 
significant. 

Table II presents means, variability and t ratios on the Index 
for the experimental group at the beginning and end of the semes- 
ter. In the therapy group, Scale I, which is an expression of the 
individual self-concept, increased significantly in that the changes 
that occurred could be attributed to chance factors but five times 
out of one hundred. Scale II, which is an expression of the accept- 
ing or rejecting nature of the feelings the individual holds towards 
self, increased even more significantly in that the differences were 
significant at the .01 level of confidence. Scale III, representing 
the expression of the ideal self, was raised but not to any degree 
of statistical significance. Scale III-I which Bills has termed the 
discrepancy index and which relates to the discrepancy between 
the individual’s ideal self and his self-concept has been described 


TaBLe II.—Means, STANDARD DEVIATIONS AND COMPARISONS OF PRE- 
AND POST-TESTS ON THE INDEX OF SELF ADJUSTMENT 
FOR THE THERAPY GROUP 






































Pre-test Post-test Level of 

Scale N M:-M3! F ie Signific- 
M ‘ M . ance 

I 27 | 178.67) 14.38 | 184.61) 13.91) 6.09 | 1.06*| 2.11 .05 
II | 27 | 189.17) 21.84 | 202.00) 18.90) 13.33 | 1.33*| 3.76 .O1 
III 27 | 194.44) 16.35 | 196.46) 13.71] 3.72 | 1.42*| 1.48 | NS. 
III-I | 27 15.72, 15.10; 13.11 10.27/—1.18 | 2.16" 51) NS. 





‘The difference between each pre- and post-test score was calculated 
for each person on each item by subtracting the latter from the former. 
Means and standard deviations of these differences were then compared. 

* F is not significant at the .05 level of confidence. 
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as an index of maladjustment. Positive movement in psycho- 
therapy should be accompanied by reduction in the discrepancy 
index. Results for the group therapy class indicate this decrease 
in scale III-I but not to any degree of statistical significance. 

The data obtained for the lecture-discussion group reveal that 
such changes as were effected did not attain or approach statistical 
significance. Aithough the means of Scale I did increase some- 
what, means for Scales II and III decreased. The Discrepancy 
Index moved in the direction of better adjustment but did not 
approach significance. 

As in the lecture-discussion group the data for Control Group I 
indicate that changes effected were not statistically significant. 
Movement on Scales I, II and III were all in the direction of 
decreased means representing a decline in feelings of self-worth, 
self-acceptance, and of ideal self. Movement in the Discrepancy 
Index was in the direction of greater adjustment, but did not 
approach statistical significance. 

The data obtained for Control Group II reveal that with the 
exception of Scale III changes effected did not approach statistical 
significance. Movement on Scales I and II was in the direction of 
a decrease in self-worth and self-acceptance. Significant, however, 
at the .01 level of confidence is the movement of this group in the 
direction of a lowered ideal self. The discrepancy index was in the 
direction of greater adjustment but did not approach statistical 
significance. 

Table III presents the intra-group changes from the beginning 
to the end of the semester in the experimental group along with 
the ‘‘t’’ ratios testing the significance of these changes. Only two 
of the twelve variables in the group therapy class reached signifi- 
cance at the .05 level of confidence, Pd (Psychopathic deviate) 
and M (Masculinity-Femininity) at the .01 and .05 levels, respec- 
tively, although the Pa (Paranoia) and the K (Correction factor) 
did approach significance. Table III also shows that the direction 
of movement on the L (Lie), Hs (Hypochondriasis), D (Depres- 
sion), M (Masculinity-Femininity), Pt (Psychasthenia) and Sc 
(Schizophrenia) was toward decreased scores. Movement on the 
K (Correction factor), F (Validity scale), Hy (Hysteria), Pd 
(Psychopathic deviate), Pa (Paranoia) and Ma (Hypomania) 
was toward increased scores. 

In the lecture-discussion class four of the twelve variables 
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TaBLE II].—Means, STaNpDARD DEVIATIONS AND COMPARISONS OF PRE- 
AND Posi-TESTS ON THE M.M.P.I. ror ExpeRIMENTAL 
Controut Groups. (A) Group THERAPY 






































Pre-test Post-test Pol oe Level of 

Scale 2 Signific- 
ance 

M o M o M o 

L 54.95 | 6.58 | 54.5 5.89 —.25 5.89 21 NS. 
K 59.02 | 7.29 | 61.39 | 9.13 2.20 6.77 1.65 .20 
F 53.22 | 4.67 | 53.61 | 6.70 .35 5.99 .29 N.S. 
Hs 54.43 | 8.78 | 52.87 | 6.23 | —1.13 8.02 71 N.S. 
D 53.89 | 10.05 | 52.33 | 9.50 | —1.21 5.30 1.16 N.S. 
Hy 55.11 | 9.15 | 57.09 | 6.75 1.83 6.49 1.43 N.S. 
Pd 54.78 | 9.22 | 60.44 | 10.45 5.44 9.93 2.79 01 
M 52.94 | 6.36 | 49.54 | 8.19 3.45 7.20 2.41 .05 
Pa 49.31 | 6.51 | 53.00) 8.83 2.79 8.09 1.75 .10 
Pt 54.28 | 10.38 | 52.94 | 8.75 | —1.62 | 10.41 .79 N.S. 
Se 55.61 | 6.24 | 55.46 | 6.96 — .32 6.89 .23 NS. 
Ma 53.44 | 8.25 | 56.33 | 10.01 2.98 | 11.24 .14 N.S. 





Nore: Using the small sample techniques and entering the “‘t’’ table 
with 26, 27, and 28 (as the case may be) degrees of freedom the ratio is 
significant at the .05 level if it is over 2.00 and at the .01 level it it is over 
2.65. 

‘The difference between pre- and post-test scores was calculated for 
each person on each item by subtracting the latter from the former. Means 
and standard deviations of these differences were then compared. 


revealed significance at the .05 level of confidence. These were in 
K (Correction factor) .01 level, Hs (Hypochondriasis) .05 level, 
Hy (Hysteria) .02 level and Se (Schizophrenia) .05 level scales. 
The direction of movement of these was towards increased scores, 
as it was also on L (Lie), K (Correction factor), F (Validity scale), 
Pd (Psychopathic deviate) and Pa (Paranoia), though the latter 
were not significant. Decrease in mean scores occurred in the D 
(Depression), M (Masculinity-Femininity) and Ma (Hypomania), 
but these changes were likewise not statistically significant. 

Only one of the twelve variables in Gontrol Group I reached 
significance at the .05 level of confidence, the F (Validity scale). 
Movement towards a decrease in mean score occurred in this and 
also in the Hs (Hypochondriasis), D (Depression), Hy (Hysteria), 
M (Masculinity-Femininity), Pa (Paranoia), Pt (Psychasthenia), 
and Se (Schizophrenia) scales. Movement towards increased 
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scores occurred in the L (Lie), K (Correction factor), Pd (Psycho- 
pathic deviate), and Ma (Hypomania). 

None of the variables in Control Group II attained significance 
at the .05 level of confidence. Movement towards increased scores 
took place in the L (Lie), F (Validity scale), Hs (Hypochondriasis), 
Hy (Hysteria), M (Masculinity-Femininity), Pa (Paranoia), Pt 
(Pychasthenia), and Se (Schizophrenia) scales. Movement towards 
a decrease in score occurred in the K (Correction), D (Depression), 
Pd (Psychopathic deviate), and Ma (Hypomania) scales. 

In interpreting these data it should be noted that both pre- and 
post-testing scale means for all groups did not reach abnormality. 
Thus, while it is possible to discuss significant scale shifts in 
relation to pre-test scores, it must be stressed that there were 
actually no significant changes in total adjustment for any of the 
groups on any of the scales of the MMPI. The significance of 
scale shifts towards increased means can therefore be considered 
of extremely tentative significance with relation to the clinical 
entity each scale is designed to measure. Decreases in means can 
generally be disregarded in view of the small range of deviation. 

The Group Therapy Class had as many shifts in mean scale 
scores in the direction of maladjustment as it had in the direction 
of greater adjustment. The Pd (Psychopathic deviate) scale, 
which purports to measure a disregard of social mores, moved 
significantly in the direction of decreased adjustment, while the 
M (Masculinity-Femininity) scale moved significantly in the 
opposite direction. A possible interpretation of the Pd (Psycho- 
pathic deviate) scale shift is that the group responded to the 
therapeutic climate by an increased tendency to flout social mores 
or conventions. A plausible speculation, in view of the fact that 
the group mean did not approach borderline significance on the 
psychopathic deviate scale, is that the movement may be inter- 
preted as a shift away from conventional behavior and a decrease 
in the repressive aspects of super-ego motivation. The plausibility 
of this speculation is further supported by the F (Validity scale) 
results where it was seen that the group significantly declined in 
expression of rigidly-held conventional attitudes. Supportive, too, 
are the data with regard to the heightened self-concept and self- 
acceptance indices of this group. It is logical to assume that move- 
ment in this direction may be permissive of greater felt freedom 
to depart from the expected behavior pattern. As indicated above, 
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the decrease in F (Validity scale) means may, for all intents and 
purposes, be disregarded. 

Scale results for the Lecture-Discussion class present a rather 
perplexing picture. Movement in the direction of pathological 
behavior occurred on the K (Correction), Hs (Hypochondriasis), 
Hy (Hysteria) and Se (Schizophrenia) scales. Strictly interpreted, 
it would seem that this class responded to the lecture-discussion 
approach of the investigator by tending to move in the direction 
of greater defensiveness, somatic complaints, and withdrawal 
behavior. A highly tentative speculation concerns the dual réle 
played by the investigator. It is conjectured that the possible 
effects of this double identity may have contributed to the genera- 
tion of more than the usual amount of anxiety caused by classroom 
practice and that this was manifested by characteristic college 
student defenses such as increased defensiveness, somatic pre- 
occupation, and greater intellectualization. 

With the exception of a significant increase in the F (Validity 
scale) for Control Group I there was no significant movement for 
both controls. The high F (Validity scale) would suggest that the 
results for this group may be of questionable validity. 

Table IV presents the class means, deviations, and ranges for 
both groups taught by the investigator, e.g., group therapy and 
lecture-discussion, on both mid- and end-term objective examina- 
tions. Examination of Table IV clearly indicates that while the 
Lecture-Discussion group did slightly better on both mid- and 
end-term examinations these differences are statistically insignifi- 


TABLE IV.—A CoMPARISON OF CONTENT EXAMINATION RESULTS FOR 
Group THERAPY AND LECTURE-DIScUSSION CLASSES 

















| Mid-Term ~ | End-Term 
Class a a ie —e = 
| Mie | R | a | M g | R — 
Group Therapy | 81.51 | 5.51 | 70-90 | $0.97 | 7.74 | 70-91 
(33) 
18 .75 
Lecture-Discus- 81.76 | 4.50 | 68-96 82.40 | 6.47 | 70-93 
sion (25) 





























Nore: Using the small sample techniques and entering the ‘“‘t’”’ table 
with 56 degrees of freedom the ratio is significant at the .05 level if it is 
over 2.00 and at the .01 level if it is over 2.65. 
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cant. It is interesting to note that grade scores remained constant 
from the beginning to the end of the semester. 

The general conclusions drawn from a statistical analysis of the 
data of this study are as follows: 

1) The major purpose of the study was accomplished in the 
finding that a course in practical psychology conducted by using 
such group therapy procedures as student-centered classroom 
practice and sociodrama measurably modified attitudes associated 
with racial, religious, and ethnic prejudice. 

2) A course in practical psychology when conducted along 
traditional lecture-discussion methods of instruction did not 
significantly modify these same attitudes. 

3) The factor of attendance at Brooklyn College within the 
Vocational Diploma Program for the experimental period did not 
produce significant modification of these same social attitudes. 

4) Although statistically significant changes took place on both 
E and F scales for the experimental group only, statistical analysis 
of the data revealed that the attitude measured by the F scale 
(covert attitudes of prejudice) was modified more significantly. 

5) Inspection of the variable changes comprising both E and F 
scales revealed that the most significantly effected scale clusters 
were the conventionality and the authoritarian agression items. 
Seale variables approaching significance were the authoritarian- 
submission and the superstition and stereotype items. 

6) The data on the Index of Adjustment and Values were 
generally supportive of the attitude-scale findings. Significant 
increases in the Self-Concept (Scale I) and Acceptance of Self 
(Scale II) took place for the Group-Therapy class only. The dis- 
crepancy index (adjustment index III-I) while moving in the 
direction of greater adjustment did not approach significance. 

7) The Minnesota Multiphasic Inventory did not reflect readily 
understandable results. Significant movement took place within 
the Group Therapy class on the Pd (increased) and M (decreased) 
variables. The Lecture-Discussion class shifted significantly upward 


-on the K, Hs, Hy, and Se scales. Movement within the control 


groups was limited to a significant increase in the F variable for 
Control Group I. It appears that the MMPI when used with a 
population of ‘normal’ individuals may not be the ideal instrument 
to measure movement towards greater or lesser adjustment within 
a ‘normal’ frame of reference. 
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8) A subsidiary finding for the two groups taught by the in- 
vestigator was the practical similarity of course content test 
results on both mid- and end-term examinations. 


IMPLICATIONS 


The results of the present study have rather clearly demon- 
strated the relative superiority of therapeutic classroom techniques 
in modifying significantly inter-group prejudice. In doing so it has 
supported the findings of those investigators who believe prejudice 
to be but a symptom of a personality need syndrome, which is not 
readily amenable to modification by factual information, exhorta- 
tion, propaganda or other simple linear approaches. It further 
supports the view of such clinical educators, if such they may be 
called, as Rogers, Cantor, Snygg, Combs, Shedlin and Kilpatrick, 
to mention but a few, who insist that education is only truly 
effective in the degree that the individuality and integrity of the 
student is truly respected. This education is based on the assump- 
tion that the student can be trusted to learn in every way which 
will maintain and enhance self, and he can be trusted to grow, 
provided the atmosphere for growth is available to him. 

It has been indicated by this research that non-directive and 
sociodramatic techniques have contributed towards a total self- 
development that was reflected in increased self-insight, greater 
self-acceptance and a decrease in attitudes associated with racial, 
religious, and ethnic prejudice. Exemplary of much of this growth 
are several excerpts drawn from course evaluation forms submitted 
by students in the group therapy class: 

“The most important thing that I have gained is self-confidence 
and a feeling that I am not the only person who feels insecure, 
hesitant, doubtful and quite often frustrated. I have a fuller, more 
rounded outlook on life. I see my friends, relatives and teachers 
as human beings and friends rather than scientists with micro- 
scopes waiting to study me. I feel, that, now, we are all alike. I 
find that I am unconsciously more sympathetic toward people and 
their troubles, more friendly and can certainly hold a more intelli- 
gent conversation due to the knowledge I learned this term—and 
it was plenty.” 

“As a result of this course I think I have gained a broader 
outlook on the ‘whys’ and ‘hows’ of the behavior patterns of 
people around me. It has changed my approach to everyday 
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living because of the technique that was used, especially those of 
the dramas which were used and which depicted the same every- 


day situations most people have gone through.’’ 

“T have gained quite a bit from this course. I have learned that 
most people don’t do things just to be nasty but because of some- 
thing that bothers them. It has taught me to have patience with 
my friends and relatives when they are upset and annoyed. I have 
further learned to control my temper.” 


‘4 
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ATTITUDES OF PARENTS OF DIFFERING 
SOCIO-ECONOMIC STATUS TOWARD THE 
TEACHING PROFESSION’ 


WILLIAM F. ANDERSON, JR. 


University of Alabama 


The purpose of the present paper is to report on an investigation 
of the relationships between socio-economic status and parental 
attitudes toward the teaching profession. The following are 
reported here: (1) The prestige of teaching occupations; (2) The 
suitability of the teaching occupations for sons and daughters; 
(3) The desirability of the teaching occupations for sons and 
daughters. 

SUBJECTS AND METHODS 


Expressions of attitudes were obtained by asking the parents of 
a selected group of high school pupils to rank three teaching 
occupations (grade school, high school, and college) along with a 
selection of other occupations representing a wide socio-economic 
range. The rank assigned the teaching occupation is accepted as 
a measure of attitude. 

Measuring attitudes toward the teaching profession.—A question- 
naire was constructed to facilitate the investigation of parental 
attitudes toward the teaching profession. In constructing this 
questionnaire the first step was to select two lists of occupations, 
one for sons and the other for daughters which (1) were common 
to Iowa, (2) had a comparatively large number of workers, and 
(3) ranged from unskilled to professional. For the part of the 
questionnaire dealing with prestige, ninety-three occupations were 
used. For the parts dealing with suitability and desirability, 
forty-one occupations were used for sons and twenty-nine for 
daughters. 

Instructions to parents were in part as follows: (for prestige) 
‘Some occupations seem more important than others. Perhaps 
all of us respect some occupations, and the men and women in 
them, more than we do others. Place a figure 1 in front of the 





1 The article is based on the writer’s Doctorate dissertation, State Uni- 
versity of Iowa, 1952. Grateful acknowledgment is expressed to Professor 
James B. Stroud under whose direction the work was done. 
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occupations you have the highest respect for. ... Place a figure 5 
in front of the occupation you have very low respect for. You 
may assign the same rating to as many occupations as you like.” 

The following are the instructions employed for suitability: 

“In this part of the questionnaire we are asking you to think 
about each occupation and to judge how well it is suited to your 
son (or daughter). . . . In light of your son’s (or daughter’s) abilities 
and qualities which occupations is he (or she) best suited for? 
Please remember that he may be well suited to more than one 
occupation.” Place a figure (1) in front of the occupations which 
are very highly suited to your son’s (or daughter’s) abilities and 
qualities. ... Place a figure (5) in front of the occupations which 
are very poorly suited to your son’s (or daughter’s) abilities and 
qualities. 

Parents were asked to rate the same lists of occupations (for 
sons and daughters) as to desirability again on a five-point scale. 
Here parents were asked to make the ratings in terms of how well 
they would like to see their son or daughter enter them. 

Characteristics of the sample-—The questionnaire was adminis- 
tered to parents of certain of the six hundred and sixty-six tenth- 
grade students in the four public secondary schools of Cedar 
Rapids, Iowa, in the spring of 1952. 

By use of the occupation of the wage earning parent, which in 
most cases was the father, and the address, an estimate of the social 
status of each pupil was made. This amounted to a preliminary 
screening. Each pupil was placed in one of four lists based on 
this initial social class estimation: upper, upper-middle, ‘lower- 
middle, and lower. From each list a randomly drawn tentative 
sample was selected. To this sample the Index of Social Class 
was applied. When necessary adjustments were made, the 
sample appeared as follows: 























Son Daughter 
Total 
Father Mother Father Mother 

Upper-class 15 15 15 15 60 
Upper-middle 15 15 15 15 60 
Lower-middle 15 15 15 15 60 
Lower-class 15 15 15 15 60 
60 60 60 60 240 
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In ninety-five families both parents codéperated, while in fifty 
families only one parent filled out the questionnaire. 

Measuring the socio-economic status.—The method selected for 
determining socio-economic status was the Index of Status Char- 
acteristics, 1.8.C., developed by Warner, Meeker and Eells.? The 
formula employed for this estimation of social status is a weighted 
combination of values assigned to (a) Occupation, (b) Source of 
income, (c) House type and (d) Dwelling area. 

Each parent selected in the tentative sample was personally 
contacted by the writer. This contacting interview had as its 
aim the establishing of sufficient rapport to bring about acceptance 
of the questionnaire. During the interview, information needed 
to complete the I.S.C. was gained. Where needed, corrections 
were made as indicated above. A date was set for personally 
picking up the completed questionnaires. If upon returning to 
the parent the questionnaire was not completed a second date was 
set for pick-up purposes. 

Coéperation was furthered by the assistance of the superinten- 
dent of schools who wrote a letter of introduction which was 
shown to the parents by the interviewer. In these ways codpera- 
tion was obtained from ninety-two per cent of the families con- 
tacted. 


RESULTS 


Prestige of teaching occupations.—The mean ratings on prestige 
for the three teaching occupations are presented in Table I. To 
facilitate interpretation the mean ratings of several other occupa- 
tions varying widely in prestige are given. In both instances 
separate means are listed for the four social classes. The data 
are for sons only. . 

Noteworthy is the fact that ratings of occupations do not vary 
much with social class. This finding stood out in the entire list 
(ninety-three) of occupations, suggesting that the various occupa- 
tions are held in about the same degree of esteem by parents of 
the four social class levels. Mickey obtained a similar result 
when he asked ninth-grade students to rate occupations as to 
prestige.® 





2 W. L. Warner, M. Meeker and K. Eells, Social Class in America, Chi- 
cago: Science Research Associates, 1949, pp. 121-159. 

*E. C. Mickey, High School Student Rating of Occupational Prestige, 
Master’s thesis, State University of Iowa, 1948. 
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TABLE I.—MEAN PRESTIGE VALUES OF OCCUPATIONS FOR SONS BY 
SocraL CLAss OF RATERS 








Upper- Lower- 
Occupation ; Upper-class middle middie /(Lower-class 
Class Class 
Surgeon 
College President 
Lawyer 


College Teacher 
Supt. of Schools | 
High-school Teacher 
Newspaper Editor 
Veterinarian 
Factory Manager 
Rural School Teacher 
Policeman 

Bank Teller 

Lumber Dealer 
Carpenter 

Hardware Dealer 
Mail Carrier | 
Plumber | 

















NINBMDOHAONRPOCNHNNOOCHROMBHORRWHO 
PROWWNWNNNNNNNNNNNE NEE ee 
POMROHRONMAIBMWAMOCONDOUUWOSO 
PR OWNNNNNNNHNNNND BB eee ee 
OK AND PRONN RHEE DWDOONNAMaAWNSs 
WNNNNNNNNNE NE eee ee 
WOMAN NNMROCOwWMNDSDHOSCODDDSO 


Traveling Salesman | | | 3 
Sales Clerk | | | 3. 
Delivery Man | | | | 3. 
Waiter | | | 3. 
Bartender | | | 4 


| 





Also significant is the comparatively high prestige ratings 
accorded the teaching occupations by all social classes. This 
finding contrasts with the fact that important social class differ- 
ences were found when parents were asked to rate occupations on 
suitability and desirability. 

Suitability of teaching occupations for sons and daughters.—Do 
the views expressed by parents regarding the suitability of teach- 
ing for their sons and daughters vary with socio-economic status? 
The data in Table II show that the two upper classes rated all 
three of the teaching occupations as being more highly suited than 
did the two lower classes. The two lower classes rated teaching 
less well suited to their sons and daughters. In fact, except for 
lower class father’s attitudes (as expressed) toward grade-school 
teaching for daughters it can be said that parents of the two lower 
classes regard teaching as poorly suited to their children. Of 
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TABLE II.—MEANS AND STANDARD DEVIATIONS OF THE RATINGS OF THE 
SuITABILITY OF TEACHING OCCUPATIONS FOR SONS AND DAUGHTERS OF 
DIFFERING Socio-Economic Stratus 



































College High School Grade School 

Father Mother Father Mother Father Mother 
U?:8 M & (SD) | 2.7 (1.1) | 3.0 (1.4) | 2.3 (1.0) | 2.6 (1.2) | 2.7 (0.8) | 3.1 (1.2) 
U:D | M & (SD) | 2.7 (1.1) | 2.3 (1.2) | 2.3 (1.1) | 2.3 (1.2) | 2.7 (0.9) | 2.4 (1.0) 
UM:S8 Mé (SD) | 2.8 (1.4) | 2.9 (1.3) | 2.5 (1.2) | 2.5 (1.4) | 2.6 (1.3) | 2.9 (1.3) 
UM: D | M & (SD) | 3.3 (1.2) | 3.4 (1.2) | 2.9 (1.1) | 2.9 (0.9) | 2.5 (1.0) | 2.2 (0.9) 
LM:S | M & (SD) | 3.6 (1.3) | 3.8 (1.2) | 3.3 (1.5) | 3.1 (1.2) | 3.4 (1.5) | 3.1 (1.3) 
LM: D | M & (SD) | 3.5 (1.5) | 3.7 (1.4) | 3.1 (1.3) | 3.5 (1.4) | 2.7 (1.0) | 3.3 (1.3) 
L:8 | M & (SD) | 3.9 (1.2) | 3.9 (0.8) | 3.6 (1.5) | 3.8 (1.2) | 3.7 (1.5) | 3.9 (0.9) 
L: D | M & (SD) | 3.5 (1.0) | 3.6 (1.4) | 2.9 (1.1) | 3.3 (1.5) | 2.8 (1.1) | 3.2 (1.3) 

| 





U* Upper class parents. 

8 Sons, ete. 

Nore: Due to the instructions employed in the questionnaire the lower the mean the higher 
the attitude of suitability. 


special interest is the high rating given high-school teaching by 
the two upper classes, the second and fourth highest places among 
the forty-one occupations. In the two lower classes, high-school 
teaching was ranked nineteenth and thirty-first with grade-school 
and college teaching ranking even lower. 

Mothers of the several classes differed from fathers in that they 
judged all three teaching occupations a little less suitable for their 
sons—with the exception of the mothers of the lower-middle class. 

As a means of determining the statistical significance of social 
class differences in ratings, and differences among the ratings of 
the three occupations, analyses of variance (f test) were computed. 
Examination of these data on the fathers’ ratings revealed that 
the differences among the social classes concerning the suitability 
of the three teaching occupations for their sons were statistically 
significant at the five per cent level.t The analysis of variance 
showed that differences in the ratings of the three teaching occupa- 
tions were also significant. 

Mothers of the four social classes differed significantly (f test) 
in rating the suitability of the teaching occupations for their sons. 
This analysis also revealed that mothers did not judge the three 
teaching occupations as equally well suited for sons but rated 





‘ Hereafter, when significant difference is used it refers to a difference 
that is statistically significant at the .05 level or better. 
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high-school teaching as significantly better suited than either 
grade-school or college teaching. 

Parents with daughters judged the suitability of the three 
teaching occupations differently than did parents with sons. As 
can be seen in Table II, college teaching is strictly an upper class 
occupation; only this social class rated it as well suited for daugh- 
ters. A much different attitude exists about the suitability of 
grade-school teaching; fathers of all four social classes rated this 
teaching occupation above average while mothers of the moré 
favored environments rated it highly suitable and mothers of the 
two lower classes rated it slightly below average. 

With respect to high-school teaching, parents of the upper class 
rated this teaching occupation highly suitable for daughters while 
the upper-middle class rated it somewhat lower but still above 
average. Fathers of the two lower classes rated it average in 
suitability, while mothers judged it somewhat below average. 

Once again mothers differed from fathers in judging the suita- 
bility of the three teaching occupations. Ratings of mothers of 
the upper class were higher than those of fathers, but in the less 
favored environments, the opposite trend was noticed. 

The analysis of variance of the ratings of fathers with daughters 
revealed statistically significant differences in the way the three 
teaching occupations were rated, but non-significant social class 
differences. Analysis of variance of mothers’ ratings revealed 
both statistically significant social class and occupational differ- 
ences. 

Desirability of teaching.—Parents of different socio-economic 
status were asked to think less realistically and to judge just how 
much they would ‘like’ to see their son or daughter enter the 
occupations previously rated as to suitability. 

Means and standard deviations of ratings of the teaching occu- 
pations as to desirability are found in Table III. Since the 
differences between the ratings of mothers and fathers were so 
small, the attitudes of parents have been combined. 

The only difference that is significant among the ratings of the 
upper and lower social classes on desirability is that for the cate- 
gory—college teaching. Among the forty-one occupations rated 
on desirability for sons college-teaching was ranked the fifth and 
third most desirable by the upper and upper-middle classes, 
respectively. High-school teaching was rated ninth and tenth 
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TaBLE III.—MEANS AND STANDARD DEVIATIONS OF THE DESIRABILITY OF 
TEACHING OccUPATIONS FOR SONS AND DAUGHTERS OF DIFFERING 
Socio-Economic Status 














College High School Grade School 
Son |Daughter| Son Daughter) Son ‘Daughter 

Upper-class Mean 1.8 1.7 2.5 2.0 2.9 2.8 
SD 1.0 0.7 0.9 1.1 1.1 1.2 

Upper-middle Mean | 1.8 | 2.5 | 2.3 | 2.4 | 2.8 | 2.2 
SD 0.8 1.0 1.2 1.0 1.1 1.0 

Lower-middle Mean 2.4 2.7 2.8 2.9 3.2 2.7 
SD 1.5 1.5 1.4 1.4 1.4 1.3 

Lower-class Mean 3.1 2.7 2.8 2.2 3.1 2.3 
SD 1.5 1.3 1.4 1.3 1.4 1.0 


























Nore: The lower the mean the higher the attitude of desirability. 


most desirable by these same classes and was judged more desirable 
than such professional occupations as newspaper reporter, drafts- 
man, insurance agent and pharmacist. Although rated somewhat 
lower, grade-school teaching was judged more desirable by these 
two classes than all the skilled occupations. This was also true 
in the lower-middle class with the exception of electrician, carpen- 
ter and machinist, which were judged more desirable than grade- 
school teaching. The upper-class judged college-teaching to be 
the most desirable for daughters of all twenty-nine occupations 
rated, with high-school teaching judged more desirable than such 
professional occupations as office manager, registered nurse, social 
worker, stenographer, purchasing agent and newspaper reporter. 
The upper-middle class judged grade-school teaching to be the 
second most desirable for daughters of all occupations rated, with 
high-school and college teaching rated only slightly lower. The 
two lower classes rated teaching only slightly less desirable for 
daughters than did the favored classes, although they rated cer- 
tain business occupations (stenographer, typist and bookkeeper) 
as being slightly more desirable than teaching. Other business 
occupations were rated considerably less desirable than teaching. 

It is an interesting observation that although teachers tend to 
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come from the lower and lower-middle classes’ (probably making 
them about average proportionately with respect to social class), 
parents of these two classes, as suggested by this investigation, 
appear to regard teaching as a less suitable and less desirable 
occupation for their sons and daughters than do parents in the 
two upper classes. 


CONCLUSIONS 


1) In rating occupations as to prestige value the parents of this 
sample showed little difference one social class from another. 
Mean ratings for the four social classes were remarkably similar 
for the four classes. All three of the teaching occupations dealt 
with in this paper ranked relatively high on the prestige scale. 
Among the ninety-three occupations the composite rank value on 
a five-point scale for college teaching was 1.6 (1.1 being highest); 
for high-school teaching, 1.8; and for grade-school teaching, 1.9. 
(Eight occupations ranked higher than college teaching, in this 
order: surgeon, physician, college president, judge, clergyman, 
architect, dentist and electrical engineer.) 

2) Ratings on suitability of the teaching occupations varied 
with social class status. Parents of the upper two classes rated 
the teaching profession more suitable for their sons and daughters 
than did parents of the lower two classes. 

3) Ratings on desirability of teaching as an occupation also 
varied with social class status, although to a lesser degree, except 
for college teaching. 

4) Ratings on the suitability of the three teaching occupations 
varied as to the sex of the pupil. For all social classes combined, 
parents judged high-school teaching to be significantly more 
suitable for sons than the other two teaching occupations. The 
upper three classes rated college teaching as being more desirable 
for their sons. With the exception of the upper class, grade- 
school teaching was judged to be better suited to daughters than 
the other two teaching occupations. It was also rated as being 
much more desirable for daughters than for sons. 





5 Nathan A. Edwards, Sociology of Teaching. II: A Study of the Male 
Classroom Teacher, Doctor’s dissertation, State University of Iowa, 1952. 








¢ 
oA 











COMPARATIVE DECLINE OF GRADUATE 
RECORD EXAMINATION SCORES AND 
INTELLIGENCE WITH AGE 


R. TRAVIS OSBORNE and WILMA B. SANDERS 


University of Georgia 


From the literature (3, 4, 7, and 8) it is possible to infer that 
adult age increments are accompanied by a decline in acquired 
knowledge which would parallel the decline in intelligence with 
age. However, with the exception of Learned and Wood’s The 
Student and His Knowledge (3) and a previous study by the authors 
(4) there is almost a complete lack of direct data on the compara- 
tive decline of acquired knowledge and intelligence of adults. 
This investigation is for the purpose of securing research evidence 
relevant to this problem. 


POPULATION AND PROCEDURE 


In the present study the Graduate Record Examination was 
used for determining the student’s level of acquired knowledge. 
This test according to the G.R.E. prospectus “‘is the only examina- 
tion of national character which is expressly designed to measure 
achievement in the principal fields of collegiate study at the 
advanced level and which is endorsed or required by graduate and 
professional schools” (2). Actually the G.R.E. is a series of 
subject-matter tests designed to cover broadly the main areas of 
the typical undergraduate program of instruction: Mathematics, 
Physics, Chemistry, Biological Science, Social Science, Fine Arts, 
Literature, and Verbal Factor. ~The latter test is not a content 
test in the true sense of the word but it was developed and included 
in the G.R.E. battery to measure the ability to discriminate word 
meanings. 

The one thousand eight hundred and seven graduate students 
who were in attendance at the University of Georgia between 
February, 1946, and February, 1952, were subjects for study. 
This group includes all graduate students except some few irregular 
students who were not seeking an advanced degree. The seven 
professional schools and the five divisions of the College of Arts 
and Sciences were represented. Men students out-numbered 
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women: one thousand one hundred and seventy-three to six 
hundred and thirty-four. 

G.R.E. scores, personal data, and college marks were available 
from the files of the Dean of the Graduate School. Machine 
methods were used for processing and conventional statistical 
procedures were used in analyzing the data. 

Figures 1 and 2 compare graphically G.R.E. age-score and 
intelligence curves. The two sets of data were not obtained from 
the same subjects; the age-achievement curve was plotted from 
the results of G.R.E. and the intelligence curve from Wechsler’s 
normative data. For comparative purposes these curves have 
been reduced to approximately the same scale. 

The number, mean, and standard deviation for the seven profiles 
and three G.R.E. averages were determined for each four-year age 
increment from nineteen to fifty-five (Table I). Also, not shown 
but available on request there are tables of inter age-group confi- 
dence levels of mean differences for the seven achievement profiles. 
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Fig. 2.—Variations in G.R.E. Non-Science Profiles and Intelligence with 
Age. 


RESULTS 


Examination of Figure 1 shows that for the Sciences adult 
achievement loss with age is strikingly similar to the decline of 
mental ability with age. For Physics and Chemistry the age- 
score curves almost parallel the declining mental ability curve. 
Maximum performance in Chemistry is maintained slightly 
longer and the decline is somewhat more gradual than for Physics. 
Mean scores in Biology, however, fall off sharply after reaching 
a peak in the nineteen to twenty-two age level. The loss here 
is greater than the corresponding loss in intelligence with age. 
Among the sciences, mathematics scores seem to hold up best. 
Significant losses were not noted until after age thirty-eight. 
This loss is considerably less than the corresponding loss for these 
ages in intelligence. 

For Social Science there is no significant falling off of mean 
achievement scores with age. The greatest mean difference is 
only 3.6, and that is found between two adjacent age groups in 
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TABLE 8 a 


GRAQUATE RECORD EXAMINATION PROFILE TEST SCORES FOR 
UNIVERSITY OF GEORGIA GRAOUATE STUDENTS, BY AGE 
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PHYSICS CHEMISTRY BICLCGICAL SCIENCES MATKEMATICS 
AGE Mean MEAN MEaN MEAN 
Sroue NO. S¥. Sc.e $0. NOe ST. SCo SD NOe ST. SC.* Sede NO. Ste SCo® S.0, 
Eee ——————————— 
19-22 132 50.39 9.68 128 «51.40 10.87 132 54,54 99,14 9) 49.27 9,3! 
23~=26 413 51.86 9,19 400 52.05 10.31 413 53.26 9.14 339 51.10 10.18 
31-34 216 50.84 9.41 202 50.25 9.50 216 49,59 9.71 164 49,43 4 10,6) 
35238 205 49.52 10.00 175 47,84 9.07 204 46.92 9,60 153 Steed = A 
33=42 155 46.92 9.68 142 47,66 8.94 55 45.13 &,74 118 47,7) 9.62 
43-46 96 47.42 19.61 6&6 47.71 9,50 98 45.79 10.57 6&8 47.49 ti, 
47259 54 46.50 9.19 §3 48.C7 10.00 54 43,62 9.02 4) 47.40 10.3 
51=54 34 42.91 8.66 33. 44.07 8.32 34 40.39 9.25 25 49,95 St 
55 up 22. «41.59 9.4) 2) 44.21 5.65 22 «0644626 =: 100 00 14 36.64 12.4 
i 
; 
FINE ARTS Lt TERATURE SOCIAL SCIENCE eRe Es AVERAGES 
AcE MEAN MEAN MEA PHY, SO0CeSC. 
GROUP NO. ST. SC.e Sele NOo ST. 8c.¢ $eDe NO. Se SCee $206 G10. Lit. 

CHEM. FeAe Tota 
19=22 132 $2.96 40.65 132 = 2.82 9.73 132 50.55 9.74 $2.82 52.18 Si 
23-26 410 50.97 9,95 413 50.49 9.89 413 50.08 2,43 $2.39 5051 51.4 
2730 343 43,29 10. 352 48,82 {0.27 354 43.28 10.10 $1.39 49.13 50.3 
31-34 218 49.31 10.30 216 49.07 10.33 216 50.11 10026 §€.23 49.50 43.5 } 
25038 2020 449.21 9.99 204 49.66 10.05 203 $1.02 11.19 46.11 49.96 49,% 
39=42 152 48,93 9.55 155 50.38 9.12 155 49,67 9,74 4€.54 49,66 48.0 
47-50 $3 44,88 7.84 54 47.95 6.08 54 47,74 &.13 46.28 46.86 46,6 
51-54 33 $8.92 8.69 34 47.56 &.90 34 42.34 8,45 *42.14 48.27 45,7) 

55 ue 22 «52.43 9.40 22 «$4.27 $.89 22 50.55 9.59 42.32 52.40 46.56 








oUntversity OF Georcra worms (N#t7I6) 


the forties, suggesting the probability of sampling errors or factors 
other than significant age-score trends (Table I). The smoothed 
Social Science age-score curve does not follow the mental ability 
curve but forms a line almost parallel to the base through the 
group mean (Figure 2). 

Fine Arts and Literature means not only hold up with age but 
also some statistically significant mean differences favor the older 
student (Table I). Only from about twenty to twenty-five do 
these smoothed age-score curves parallel the intelligence curve 
(Figure 2). As was the case for Social Science, the Literature and 
Fine Arts age-score curves approximate lines parallel to the base 
through the group mean. There is almost an imperceptible 
tendency for these three tests to become elevated in the forty to 
forty-five age range, level off, and then rise again in the over 
fifty-five group. 
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DISCUSSION AND SUMMARY 


The primary purpose of this study was to compare the decline 
with age of acquired knowledge and intelligence. Previous in- 
vestigations have implied that there could be expected a gradual 
decline in achievement test scores after about age twenty-four to 
twenty-five, the age according to Wechsler at which ‘our intel- 
lectual capacities are definitely on the decline” (8). However, in 
so far as the present writers have been able to determine, no 
previous attempt has been made to compare the rates of decline 
in adult achievement and intelligence with age. Peterson did 
report in 1943 that “for each age there is a trend toward a lowered 
G.R.E. average score with increasing age” (6), but his findings 
were based on only thirty-two cases in the over twenty-nine 
age group and he made no attempt to compare intelligence and 
achievement decrements. By averaging G.R.E. profile scores 
the differential rates of decline were lost and the slope of the age- 
score curve was attenuated. 

There are several possible interpretations of the results presented 
above. For example, Peterson attributes the decline in G.R.E. 
scores with age to “the fact one loses considerable facility or 
becomes ‘rusty’ in a subject for which one has little or no oppor- 
tunity for practice while one loses little in a function which is 
constantly practiced” (6). On the other hand if the G.R.E. is 
just a long test of verbal intelligence (5) it would naturally be 
expected to show the characteristic decline of mental ability with 
age. 

Although research evidence is lacking on the point, Gilbert 
believes ‘“‘motivation plays an important part in intellectual 
decline with advancing age, for when motivation is absent, we find 
waning interest in things and a decline in intellectual activity 
and use of intellectual abilities. The result of this disuse, of 
course, is accelerated decline” (1). Still other explanations may 
occur to the reader; such as, for example, the inadequate sampling 
techniques inherent in cross-sectional studies. The fact that if 
the older age groups are found to be heavily weighted with women 
and/or Social Science, Literature, or Fine Arts majors would 
account for the sustained mean scores in these areas. 

The results presented in the previous section would appear to 
support the hypothesis of a differential rather than a consistent 
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rate of decline of acquired knowledge with age. Only the science 
profiles of the G.R.E. follow the characteristic curve of mental 
decline with age. Social Sciences, Fine Arts, and Literature 
scores seem to hold up well even at the extremes of the age range. 
As the rates of decline are largely functions of the nature and 
content of the tests used these findings should not necessarily be 
expected to hold for different tests, other age groups, or for any 
individual student. 
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THE SOPHOMORE GENERAL CULTURE 
TESTS AND STUDENT KNOWLEDGE 
OF CULTURAL FACTS 


BERT D. ANDERSON 


Southern State College, Magnolia, Ark. 


The problem of appraising the college student’s knowledge of 
cultural facts has received a good deal of attention in recent years. 
Special interest in this matter is justified since educators generally 
agree that American colleges have the responsibility of acquainting 
students with scientific and cultural information that is felt to be 
basic in perpetuating our society. Various programs of general 
education have been devised to improve the student’s knowledge 
of his heritage and the responsibilities he must accept if he is to 
become a worthy citizen in these critical times. In all of these 
programs it has been assumed that if the student possesses a 
certain body of general information he is more likely to become 
an effective citizen in our democracy. 

Perhaps the Sophomore General Culture Test in its various 
forms has been more widely used than any other single instrument 
to estimate the extent to which the college student has acquired a 
knowledge of cultural facts that are assumed to be significant in 
our society.' It is common practice to administer this test at the 
end of the sophomore year when the student has completed most 
of his general education training. 

The major research pertaining to this instrument has been 
conducted on a national basis of the Educational Testing Service 
through its national college testing program. Their findings have 
enabled individual colleges to compare the performance of their 
students on this test with that of a national college group. They 
have demonstrated that there is a wide variation in scores made 
within and between colleges on the Sophomore General Culture 
Test.” * 4» 5 However, for the local administrator of the general 





1 One hundred seventy colleges representing thirty-seven states partici- 
pated in the 1950 College Sophomore Testing Program. 

? Committee on Educational Testing, ‘‘Report of the sophomore testing 
program,’’ Educational Record, x1v, 1933, pp. 522-71. 

* The 1932 College Sophomore Testing Program, ‘‘A report by the Ad- 
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education program and the counseling staff these very practical 
questions arise: (1) How comprehensive is the coverage of this 
test when its items are related to course offerings found within the 
general education program? (2) How should the student’s per- 
formance on this test be interpreted when related to the instruc- 
tion he has received in the local general education program? 
(3) What do these scores tell us about the student’s knowledge of 
cultural facts? 

A partial answer to the above questions was found at the Uni* 
versity of Utah by (a) tabulating the Sophomore General Culture 
Test items according to the courses to which they were related 
that were offered within the general education curriculum and 
(b) ascertaining the relationship between scores received on the 
General Culture Test and the student’s past achievement in four 
separate areas of general education. 


SAMPLE 


In the spring of 1950 the Sophomore General Culture Test, Form 
Y, was administered to eight hundred and forty-nine students 
who were completing their sixth quarter of work at the University 
of Utah. They had finished most of their general education 
classwork requirements in the Physical Sciences, the Biological 
Sciences, the Humanities, and the Social Sciences. Two hundred 
students, whose scores and stated professional aims were found to 
be representative of the total group, were selected for more inten- 
sive study and used as the sample for this research. 


FINDINGS 


I) Test items coverage of curriculum content.—Students were 
required to complete at least nine credit hours in each of four 
general education areas. These were the Biological Sciences, the 
Physical Sciences, the Social Sciences and the Humanities. The 
number of courses made available to the student varied from ten 
in the Biological Sciences and the Physical Sciences to thirty- 





visory Committee on College Testing,’’ Educational Record, x111 1932, pp. 
290-343. 

4“The Report of the 1936 College Sophomore Testing Program,’’ Edu- 
cational Record, xv11, 1936, pp. 577-608. 

5 ‘Report on the 1950 National College Sophomore Testing Program,”’ 
Codperative Test Division, Educational Testing Service. pp. 1-4. 
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TasBLeE I.—NumBeER oF Test Items Devorep To Sussecr MATTER 
TAUGHT IN THE SocraL ScrENcCE AREA OF GENERAL EDUCATION 











General acai ot amd Name of] ame itea | So Bok Tet |e Pe 
Social Science Area 
Economics 6 1 7 
Geography 2 0 1 
History 11 1 48 
Health* 8 0 
Anthropology 1 0 0 
Political Science 7 19 14 
Sociology 3 25 0 
Psychology 2 0 4 
Philosophy 2 0 0 
Social Sc. Survey 3 UC. U.C.** 
37 54 74 














* Not offered in this general education area. 
** Unclassified. 


seven in the Social Science areas. The various courses in each of 
the four areas were listed separately so that the items of the 
appropriate Sophomore General Culture Test could be related to 
the course content being offered. Some of these findings are 
presented in Table I. 

In general, the test items found within the Current Social 
Problems and History and Social Studies Tests provided inade- 
quate coverage of course subject matter being offered in the 
Social Science area. While there were forty-nine History ques- 
tions contained in these two tests, it should also be noted that 
there were eleve:: separate History courses offered. There were 
no questions pertaining directly to Philosophy or Anthropology, 
yet these two courses were regularly being taken by students 
fulfilling the general education requirement in this area. 

In the other three areas, as was the case in the Social Sciences, 
course offerings were in many instances covered by too few items 
or none at all. Only ten courses were offered in the Biological 
Science area as compared to thirty-seven in the Social Sciences; 
however, only twenty-seven items of the Science Test were directly 
related to Biological Science courses while the remaining thirty- 
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three items were designed to cover subject matter taught in ten 
courses offered in the Physical Science area. In specific subject 
matter situations the lack of coverage was also noted. For 
instance, the Literature. and Fine Arts Tests, each containing 
ninety items, could best be applied to the courses offered in the 
Humanities area. A review of these items revealed that there 
were no questions asked in either test about such famous philoso- 
phers as Berkely, James, Bergson, Russell or Nietzsche, yet the 
Philosophy offerings within the Humanities area stressed tle 
works of these men. The General Culture Mathematics Test, 
while perhaps important from other standpoints, bore little or no 
direct relationship to subject matter offered in any of the four 
general education areas. 

II) Achievement in General Education related to scores received on 
the General Culture Test.—To discover the relationship between 
the student’s past achievement in courses completed in the Bio- 
logical Sciences, the Physical Sciences, the Humanities, and the 
Social Sciences, separate grade point averages were computed for 
each area and used as the criterion of achievement. These grade 
point averages were then correlated with the scores he had received 
on the General Culture subtests. The Pearson Product Moment 
correlation method was used to establish these relationships. 
The results are reported in Table IT. 

Each subtest, as well as the total scores of the Sophomore 
General Culture Test, correlated positively with achievement in all 


TaBLeE I].—TuHE CORRELATION COEFFICIENTS FOUND BETWEEN THE 
GRADE-POINT AVERAGES IN EACH OF THE GENERAL EDUCATION 
AREAS AND ScoRES RECEIVED ON THE SOPHOMORE GENERAL 
CULTURE SUBTESTS 





Current | Histo Fi Tot. 
General Education Area Social | Socia Lit. | Science png Math. | Gen. 
Prob. |Studies _— Cult. 





Biological Science Grade | .374 | .366 | .311 | .507 | .320 | .357 | .506 
Point Average 
Physical Science Grade | .211 | .315 | .288 | .526 | .147 | .429 | .422 
Point Average 
Social Science Grade Point | .344 | .333 | .374 | .356 | .289 | .254 | .444 


Average 
Humanities Grade Point | .146 | .204 | .282 | .209 | .262 | .076 | .281 


Average 
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four general education areas. With few exceptions, these rela- 
tionships were found to be significant at either the one or five per 
cent level of confidence. Only the Science Test and the Total 
General Culture scores correlated well with the grades earned in 
the Physical Sciences and the Biological Sciences. The total 
score was found to be the best indicator of the student’s past 
achievement in the Social Sciences while none of the General 
Culture subtest scores were closely related to achievement in the 
Humanities. The total score was the most consistent index of 
past achievement in general education even though there was a 
rather wide range in the size of these correlation coefficients. 


DISCUSSION 


The breadth of subject matter covered within the general 
education program presents an almost overwhelming problem 
when an attempt is made to measure student knowledge in this 
area. With so many courses being offered in each of the general 
education areas, it is only reasonable to suspect that many differ- 
ent course combinations will be presented by students in fulfilling 
the general education requirements. The difficulties involved in 
constructing an instrument that will adequately cover all of these 
subject matter areas becomes clearly recognized. With only a~ 
few items to cover several complete courses in some cases, perhaps 
it was not at all surprising to find low correlation coefficients 
between the grade-point averages earned in each of the four areas 
and the student’s test performance. 

The lack of relationship between test scores and achievement 
and the very limited coverage of test items leads one to suspect 
that the student’s General Culture scores will not necessarily 
indicate how effective the local general education instruction has 
been in improving the student’s knowledge of cultural facts. On 
the other hand, neither does the grade-point average earned in 
each of these four areas give a well-rounded picture of student 
knowledge of cultural facts, since it is based upon a limited per- 
formance in two or three courses in a broad subject matter area. 
For instance, the student might complete the Humanities require- 
ment without ever having completed a course in Literature. 
Perhaps it is at this point that the General Culture Literature 
Test can serve a useful purpose. If, in such a case, the counselor 
is willing to assume that the student’s performance on the Litera- 
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ture test represents a rather independent but objective index of 
the student’s knowledge of literature, then these scores can be 
used to supplement grades in appraising the student’s knowledge 
of cultural facts in the Humanities area. If these General Culture 
Test results are regarded more or less as an independent index of 
cultural knowledge then their validity must be defended upon the 
premise that the test is not only technically sound, but is based 
upon subject matter that is educationally significant. The lack 
of relationship between General Culture test scores and patt 
achievement in General Education courses does tend to support 
the hypothesis that these results do provide an independent 
measure of student knowledge in these general education areas. 
It seems quite clear that no single measure will give a well-rounded 
picture of the student’s knowledge of zultural facts. 


CONCLUSIONS 


1) The General Culture test does not provide adequate test 
item coverage when a number of elective courses are made availa- 
ble to students completing their general education requirements. 

2) The General Culture test results do not indicate how effective 
the local general education instruction has been in improving the 
student’s knowledge of cultural facts. 

3) The General Culture test results may be used as an inde- 
pendent index of the student’s knowledge of cultural facts if it can 
be assumed that the test is technically sound and based upon 
subject matter that is educationally significant. 
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FINAL EXAMINATION PERFORMANCE IN 
TRADITIONAL AND CROSS-DISCIPLINARY 
COURSES 


IRWIN A. BERG and RAYMOND J. MURPHY?! 


Northwestern University 


While the number of interdepartmental or cross-disciplinary 
courses appears to be increasing among American colleges and uni- 
versities (2, 3), the rise in popularity of such courses has been accom- 
panied by pointed questions as to their academic value when com- 
pared to the traditional, single-subject courses. For beginning 
college students, the traditional course is regarded by many as 
more ‘solid’ and as providing a firmer basis for mastery of the sub- 
ject matter, while the cross-disciplinary courses are thought to 
encourage superficial and popularized treatment of several fields. 
This attitude, understandably, has been indignantly challenged by 
proponents of inter-departmental course offerings. Curiously, noth- 
ing seems to have been published which offers factual data bearing 
upon the problem. 

In 1949, with the support of a grant from the Carnegie Founda- 
tion, a cross-disciplinary course known as An Introduction to the 
Sciences of Human Behavior was jointly offered by the Depart- 
ments of Anthropology, Psychology, and Sociology of Northwest- 
ern University. The course was not intended to be merely a survey 
of these fields. Rather, the aim was explicitly one of providing the 
students enrolled in the course with the same concepts, the same 
fund of information had they taken the three introductory courses 
in anthropology, psychology, and sociology. Thus the inter-disci- 
plinary course was intended to serve as a prerequisite for advanced 
work in any or all of the three fields in the same way as the tradi- 
tional courses. But more than that, by: bringing the fruits of scien- 
tific research and the explanatory concepts of the three disciplines 
to bear upon a general aspect of human behavior, it was expected 
that the students who completed the interdisciplinary course would 
achieve a broader and firmer grasp of the subject matter than stu- 





‘The authors wish to express their thanks to Professor Kimball Young 
for his critical reading of the manuscript. 
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dents who took the three traditional introductory courses sepa- 
rately. 

The present study attempts an answer to the question of whether 
students who learned their elementary psychology in the cross- 
disciplinary Sciences of Human Behavior have mastered the sub- 
ject matter as well as those who completed the regular or traditional 
beginning psychology courses. While the human behavior students 
were exclusively freshmen and the psychology content of their 
course was spread over three academic quarters, it was nevertheless 
believed that a reasonable comparison could be made of the two 
groups even though the traditional elementary psychology course 
students were sophomores and above and had their psychology in a 
one-quarter course. The basis for comparison was sixty multiple- 
choice test items which were administered to both the human be- 
havior and traditional psychology classes by including these items 
in the final examinations during the academic year 1951-52. 
Twenty items were administered in each of the fall, winter, and 
spring quarters to a total of three hundred and eighty-four tra- 
ditional psychology course students and to approximately three 
hundred human behavior students. Although the sixty test items 
sampled the entire range of subject matter for elementary psy- 
chology, only twenty items representing about one-third of the 
total material could be given each quarter. This division was neces- 
sary because the human behavior students have the psychology 
content of the course distributed over the entire academic year, 
while the traditional psychology students complete their course in 
one quarter. The total hours spent in class and the textbook assign- 
ments were the same for both groups. The test items were selected 
from the reading assignments required of both groups. Agreement 
among the human behavior and regular psychology instructors was 
used in choosing the items for inclusion in the final examinations. 
The test items which were identical for both groups were placed 
in a block at the beginning of each examination. The mean score 
for each examination was computed for both groups and for each 
academic quarter, and the significance of the mean score differ- 
ence tested by determining critical ratios. 

Because a variety of entrance tests are used by the University’s 
Office of Admissions, it was possible to secure scholastic aptitude 
test scores for only one hundred and eighteen human behavior 
students and forty-one traditional psychology students who had 
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taken the same form of Northwestern Analogies Test (1). A check 
of the general academic performance of these students for whom 
Analogies scores were available indicated that the groups could be 
regarded as roughly equivalent. Comparisons were also made of 
the identical item scores for students from the various colleges of 
the University. This was regarded as necessary because the human 
behavior course draws its enrollment chiefly from the College of 
Liberai Arts while the traditional psychology course includes stu- 
dents from virtually every school on the campus. Since the propor- 
tion of male to female students in both courses was about the same, 
no control over this factor was deemed necessary. 


DISCUSSION 


Viewed as a pattern for the year, the performance of the human 
behavior and regular psychology groups may be regarded as reason- 
ably comparable. As reference to Table 1 will show, the traditional 
psychology group earned a significantly higher mean score on the 
final examination items for the fall quarter; however, the human 
behavior group performance was slightly higher, though not signifi- 
cantly, on those items given to both groups for the winter and 
spring quarter examinations. 

Two factors probably account for much of the difference in the 
mean examination scores of the two groups for the fall quarter. 
First, the course plan of the human behavior group requires that 
a considerable portion of the fall quarter be devoted to an elabora- 
tion of the terms and concepts used in anthropology, sociology, and 
psychology. While this time is well spent, since it provides a solid 
foundation for the rest of the year’s work, it does mean that the 


TABLE 1.—FINAL EXAMINATION PERFORMANCE OF TRADITIONAL AND 
HumAN BEHAVIOR GROUPS 











Fall Quarter Winter Quarter Spring Quarter 
Course 
N* | M {|SD/}CR/ N*;} M{|SD/CR{N* | M/ SD [CR 
Traditional 150)13.3) 2.8 114/11.9) 3.2 120)13.9| 2.6 
3.9 9 5 



































Human Behavior | 300)12.2) 2.9 265)12.2) 2.5 a00ii4.1 2.6 








*The number of traditional psychology students represents different 
classes during each quarter. The human behavior group varies in number 
because of absences from the final examination. 
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cross-disciplinary students have a temporarily leaner academic fare 
for the psychology portion of the fall quarter when compared to 
the regular psychology course. The second and perhaps more im- 
portant factor is that of academic experience. Being younger and 
completely new to the college situation, the human behavior stu- 
dents should be most handicapped during the fall quarter when 
compared to the older and academically more experienced tradi- 
tional psychology students. This advantage should be reduced in 
the winter and spring quarters as the human behavior group im- 
proved in study skills, became familiar with multiple-choice tests, 
etc. Such was actually the case as shown in Table 1. 

Thus far our data indicate that the traditional psychology and 
the human behavior groups were, from an over-all standpoint, per- 
forming about equally on the sixty psychology examination items 
administered during the year. However, while total final examina- 
tion performance was approximately the same for both groups, it 
is emphasized that the interdisciplinary students were performing 
as freshmen about as well as the regular psychology students were 
performing as sophomores or above. While the two groups were 
quite comparable in over-all performance on the psychology test 
items, it cannot be argued that the human behavior students would 
necessarily have benefited even more had they been sophomores, 
as were the regular psychology students. True, the interdisciplinary 
students, though freshmen, demonstrated a level of mastery of 
psychology which was comparable to sophomores. However, it has 
not been shown that the regular psychology students would not 
have done just as well had they taken the course in the freshman, 
instead of the sophomore, year. Furthermore, although many of 
the weaker students leave college during the freshman year, it is 
possible that motivation for topflight performance may decrease 
among the more capable students in the sophomore year, once they 
have tested their academic mettle as freshmen and found it satis- 
factory. The freshmen in the human behavior course learned as 
much psychology as the sophomores in the traditional psychology 
course, according to the data presented; and that is about all that 
can be said in the light of available information. 

A question which naturally arises is whether one group was 
brighter than the other. Since scholastic aptitude test scores were 
available for only a limited number in both groups, no very positive 
statement can be made about the influence of this factor. As shown 
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TABLE 2.—NORTHWESTERN ANALOGIES TEsT SCORES FOR SPRING QUARTER 














1952 
a “ ” SD CR 
Traditional 41 87.1 an ¢ 
Human Behavior 118 91.3 22.2 sas 

















in Table 2, the evidence we have indicates that there was no sig- 
nificant difference between the two groups. The grade point aver- 
age for all courses of the forty-one regular psychology students 
was almost exactly at the mean of the grade point average for the 
total sophomore class. The mean Analogies test score for these 
forty-one students was 87.1, while the mean Analogies score for 
all students who took the test at the same time was 86.7. Thus it 
appears that the regular psychology group sample was probably a 
representative one in terms of performance on the test and for 
grades earned. The one hundred and eighteen interdisciplinary 
freshmen for whom Analogies scores were available earned a grade 
point average which was somewhat above that of the freshman 
class. The mean Analogies score for these one hundred and eighteen 
freshmen was 91.3, while a mean score of 86.4 was earned by all 
entering freshmen. It is suggested that these data strengthen the 
belief that the human behavior and traditional psychology students 
were comparable in scholastic aptitude since the slight, but non- 
significant, difference in Analogies test performance seems very 
likely to be due to a minor sampling bias. That is, the sample of 
human behavior students had a slightly higher mean scholastic 
aptitude score but they also earned slightly higher grades than the 
sample of the regular psychology students. This is what would be 
expected if the sample was slightly biased in favor of the human 
behavior group. 

As noted earlier, the regular psychology course draws its enroll- 
ment from nearly every college of the University while the inter- 
disciplinary course draws chiefly from the College of Liberal Arts. 
If the students from one college were notably stronger or weaker 
academically, the mean final examination scores might be influ- 
enced by such a deviant group. Tables 3, 4, and 5 present data 
bearing upon this point. While the mean psychology final exami- 
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TABLE 3.—FINAL EXAMINATION PERFORMANCE OF TRADITIONAL 
PsycHoLocy Group ANALYZED By SCHOOLS FOR SPRING 
QUARTER 1952 











School — Per cent M SD 
Commerce 43 35.8 13.7 2.3 
Liberal Arts 20 16.7 14.7 2.5 
Engineering 18 15.0 13.2 2.9 
Music 11 9.2 15.5 3.4 
Education 10 8.3 13.3 2.3 
Speech 9 7.5 13.5 1.6 
Journalism 9 7.5 14.8 2.2 

Total 120 100.0 13.9 2.6 

















TABLE 4.—CRITICAL Ratio MatTrRIx OF FINAL EXAMINATION MEAN Score 
DIFFERENCES AMONG SCHOOLS REPRESENTED IN TRADITIONAL 
PsycHOLOGY FOR SPRING QUARTER 1952 








School Liberal (Ragineer! sgusic | Pave | speech | Joameh 
Commerce 1.4 8 1.9 5 3 1.3 
Liberal Arts 1.6 2 1.3 1.2 P| 
Engineering 1.6 a 3 1.5 
Music 1.6 1.6 5 
Education 2 1.4 
Speech 9 























TABLE 5.—FINAL EXAMINATION MEAN Score DIFFERENCES By SCHOOLS 
REPRESENTED IN THE HUMAN BEHAVIOR COURSE FOR SPRING QUARTER 























1952 
School N Per cent M SD 
Liberal Arts 192 73.8 14.0 2.6 
Education 48 18.5 14.3 2.8 
Other 20 7.7 14.1 2.9 
Total 260 100.0 14.1 2.6 





nation scores varied somewhat for students from the different col- 
leges, none of these differences were of statistical significance at 
even the five per cent level of confidence. Thus there is no evidence 
to indicate that differential performance on the psychology final 
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examination items had any significant influence on the size of the 
total group means. 

In conclusion, it appears reasonable to state that our evidence 
shows that the freshmen in the interdisciplinary course learned as 
much psychology as the sophomores and above in the traditional 
course. Whether the interdisciplinary students would have demon- 
strated even greater achievement had they been sophomores, or 
whether the traditional psychology students would have performed 
as well or better had they taken elementary psychology as fresh- 
men, is not known. No evidence is available which bears on this 
point. 


SUMMARY 


A study was made of the final examination performance of three 
hundred and eighty-four students in the regular or traditional psy- 
chology course and approximately three hundred students in the 
interdisciplinary human behavior course which is offered jointly 
by the departments of anthropology, psychology, and sociology 
at Northwestern University. Sixty test items which sampled the 
range of material covered in elementary psychology were adminis- 
tered to both groups as part of the final examinations during 1951- 
52. Though composed entirely of freshmen, the interdisciplinary 
students in over-all performance did about as well as the traditional 
course students who were sophomores and above. No evidence was 
found that the two groups differed significantly in scholastic apti- 
tude. 
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BOOK REVIEWS 


HELEN M. WALKER AND JOSEPH Lev, Statistical Inference. New 
York: Henry Holt and Co., 1953. 510 p. $6.25. 


At long last we have a textbook in statistics for students of 
education which is non-mathematical, comprehensive, accurate, 
and up-to-date. Though the title of the Walker and Lev book 
indicates that it may be of interest to researchers in many differ- 
ent fields, the senior author is well-known as a professor of educa- 
tion and virtually all the materia! in it is written in the context 
of problems in educational research. 

Although it may not be considered an elementary text, since by 
conventional standards it includes ‘advanced topics’ in statistics, 
it features the intuitive approach to the development of statistical 
concepts, an approach designed for the student not prepared to 
read mathematical literature. The first nine chapters are develop- 
mental and designed to cover the basic concepts of statistical 
inference. They include essential ideas of probability, population 
and sample, sampling distributions, chi-square, inferences con- 
cerning the mean and variance, and analysis of variance. The 
remaining chapters depend upon the theory introduced in the 
first part of the book in applications of statistical theory to such 
problems as correlation and regression, measurement and relia- 
bility, analysis of variance and co-variance. Included is an 
excellent final chapter on the very practical subject of non-para- 
metric methods. An example of much needed modern content 
not hitherto available in books for students in education is a 
treatment of sample survey design. 

In the first three chapters, through the medium of simple nu- 
merical problems, Walker and Lev ‘intuitively’ introduce the 
student to most of the statistical concepts he needs to understand 
statistical inference. This includes an introduction to probability 
and probability distribution (including standard symbolism for 
expressing probability statements), the notion of parameter versus 
statistic, the binomial distribution, probability density, mathe- 
matical expectation, the normal probability distribution, random- 
ness in sampling, statistical estimation, the power function, and 
sampling from a finite population. Later chapters are unusually 
thorough, accurate, and complete on the subjects they cover. 
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The usefulness of the publication is enhanced by generally good 
illustrations and examples throughout, and an exceptionally 
comprehensive set of statistical tables and charts in the appendix. 
Good references are included for students who might be interested 
in further pursuit of topics. 

Legitimate comments on this book must be so overwhelmingly 
in its favor that criticisms of it seem trivial. The writer of this 
review found the gains in the adoption of this text in an advanced 
course in educational statistics to outweigh by far the difficulties 
encountered with it. For a first edition there are a negligible 
number of typographical and numerical errors not covered in the 
errata sheet supplied by the publisher. For instance, a bar is left 
off of an X in equation (9.25), and what should be 77 appears as 
73 in an equation at the bottom of page 214. These are of minor 
consequence in the use of this text. The writer found but one 
example which seemed to distract students from the statistical 
objectives. This appeared in Chapter 9 in an otherwise excellent 
development of the model for analysis of variance. It dealt with 
archery scores, the conceptualization of which seemed about as 
difficult for students to grasp as the concepts they were used to 
illustrate. This is an exception, however, since in general exam- 
ples are realistic and well chosen. 

The unique order of presentation of materials covered in this 
book may present some difficulties. This will depend upon how 
it is used. The introductory chapters emphasize models, theory, 
and concepts, and progressively and adequately build up a work- 
ing background. However, the writer finds that inexperienced 
students find this somewhat tedious. Even in advanced courses 
education students appear to need an occasional opportunity to 
examine the application of a principle in a number of different 
ways. In one sense, it is logical to initiate such things as the 
power concept early in a course in statistical inference. From 
the standpoint of efficiency of instruction, however, this may 
profitably be delayed until after experience has been offered in 
some of the operations of statistical inference which does not 
require it. Related concepts of efficiency and consistency of 
estimates appear early in the text but are not clinched except in a 
supplementary chapter dealing with percentiles (or ‘inefficient’ 
statistics). Such matters, of course, are in the category of choices 
which usually are expected to be faced by an instructor as he 
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adapts to his class and his own objectives. The writer found it 
necessary, for instance, to supplement and emphasize equation 
(9.9) on the partitioning of sums of squares in a simple analysis of 
variance problem, an important basic concept appearing to need 
proof and elaboration. It is only fair to say, however, that there 
are undoubtedly many alternatives in arranging the sequence of a 
course in statistics and that it is too much to expect a text on this 
level to be self-teaching. 

The authors in their preface suggest that some students might 
study the text without previous courses in statistics but that those 
without facility and quantitative thinking might profit by a 
“slow-paced introductory course’. In most teachers colleges and 
colleges of education this text will probably be most useful in a 
second and/or a third semester course. As an advanced text it 
should be widely accepted. It will be found to be an efficient 
vehicle to the understanding of modern statistical methods in 
educational research. The material is there and the exposition is 
precise and clear. Francis G. CoRNELL 


University of Illinois 


WILLIAM STEPHENSON. The Study of Behavior. Q-Technique and 
its Methodology. Chicago: University of Chicago Press, 1954, 
pp. 376. $7.50. 


This book does not, as the title implies, deal with behaviour, at 
least, not in the sense that the term is usually understood by 
psychologists; it deals essentially with techniques of obtaining 
self-ratings and ratings of others, called Q-sorts by Stephenson, 
and with techniques of correlating persons and factor analyzing the 
resulting tables of correlations, called Q-techniques by Stephenson. 
It constitutes, therefore, a summary and detailed statement of 
work carried on intermittently by Stephenson since 1935, and well 
known in its main aspects to statistically-minded psychologists. 
Those who have followed this work, and the many controversies 
to which it has given rise, will turn to this book in anticipation of 
finding a detailed discussion of the main points, together with an 
empirical proof of the effectiveness of the methods suggested. 

Of the two innovations which Stephenson claims to have intro- 
duced, it may be said that the Q-sort presents a somewhat novel 








Book Reviews 375 


and, under certain circumstances, a probably useful technique of 
self-rating. The subject is given a large number of statements 
similar to those appearing in orthodox questionnaires; he is then 
asked to rank these in order of applicability to himself, putting 
the one that characterizes him best at the top, and the one least 
like him at the bottom. The relationship between a Q-sort and 
an ordinary ‘Yes’-‘No’ type of questionnaire appears to be essen- 
tially the same as that between a product-moment correlation and 
a tetrachoric correlation; though both do pretty much the same 
sort of thing, under certain circumstances the one may be prefera- 
ble to the other. Stephenson never discusses in detail why he 
considers a Q-sort to be preferable to an ordinary questionnaire, 
and this reviewer, at least, found it difficult to see why claims 
made for the Q-sort should not be applied in equal measure to the 
despised questionnaire. On the evidence supplied by Stephenson, 
then, we can only conclude that the Q-sort, while ingenious, is 
unlikely to support the elaborate super-structure erected on it by 
Stephenson. 

With regard to the technique of correlating persons, we find 
ourselves in the midst of a very stormy controversy. Most 
orthodox psychologists have pointed out that the techniques used 
by Stephenson are not new, that many of the detailed applications 
are subject to damaging statistical criticism, and that the results 
in cases where the method can legitimately be applied are no 
different from those which would have been reached by the more 
orthodox technique of correlating tests or questionnaire items. A 
good deal of Stephenson’s book is taken up with attempts to 
answer these criticisms, and the reader is liable to get lost in these 
somewhat heated exchanges. By and large, this reviewer has 
come to the conclusion, after examining the evidence carefully, 
that the critics are right and that anyone wishing to use this 
technique would be well advised to discuss the methodological 
principles involved with someone expert in this field. 

Stephenson does not help his case by showing a somewhat 
disingenuous tendency to change the meaning of the term Q- 
technique over the years, whilst pretending that what he now 
means by it is what he has meant by it all along. Simple exami- 
nation of his earlier papers and the examples quoted therein is suffi- 
cient to show that his views have changed considerably. There 
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is no harm, of course, in changing one’s views, but it does make 
controversy exceedingly difficult to follow when, quite justifiable, 
criticisms of earlier work are rebutted by referring to later and 
quite different definitions and interpretations. 

Stephenson has, from the beginning, illustrated his discussions 
by means of small-scale experiments, not, presumably, meant to 
be taken very seriously and serving only illustrative purposes. In 
the present book, one might have hoped for a somewhat longer, 
detailed, and worked-out example of his technique, demonstrating 
precisely how it could be useful in advancing psychological theory 
and knowledge. Stephenson has not taken this opportunity, 
however, but has relied again on a large number of very small and 
often exceedingly trivial examples. It is difficult to judge a 
method in the absence of even a single instance where it has been 
applied to its best advantage. However, on its present showing 
it seems difficult to take very seriously the somewhat extravagant 
claims which Stephenson makes for it. The reviewer is willing to 
reverse his judgment if a convincing example could be given to 
demonstrate the usefulness of Q-technique. In view of the failure 
over a period of twenty years to provide such an example, however, 
it seems unlikely that this judgment will be reversed. 

One last feature of the book calls for comment; namely, its 
style. A great deal of the argument is difficult to follow because 
of Stephenson’s somewhat explosive and disjointed manner of 
writing. There is little in the way of sustained argument, but a 
constant switching of discussion from statistical detail to philo- 
sophical speculation, to attack on critics, to illustrative example, 
and back again to statistical detail. Distrusting his own reaction 
to this, the reviewer handed the book to several statisticians who 
confirmed his opinion that, from the statistical point of view at 
least, it was impossible to find any concise, consecutive, and 
properly documented demonstration of Stephenson’s main claims. 

In summary, there is much that is ingenious, interesting, and 
suggestive in this book, but it cannot be recommended to anyone 
not thoroughly familiar with the field and able to discount the 
more unorthodox and less acceptable parts of the argument. 

H. J. Eys—enck 


Maudsley Hospital, London 
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Drug Addiction among Adolescents. Sponsored by the Committee 
on Public Health Relations of the New York Academy of 
Medicine with the Assistance of The Josiah Macy, Jr. Foun- 
dation. New York. The Blakiston Co., 1953, pp. 320. 


There was widespread alarm concerning the growth in the use 
of drug addiction in adolescents in the early fifties. In New York a 
number of committees were formed to better combat this evil. One 
of them was sponsored by the Welfare Council of New York City. 
A committee of fifty-eight decided to call a conference for the pur- 
pose of obtaining the thinking of authorities in medical, psychiatric, 
sociologic, educational, and law-enforcement fields. This was done 
with the help of the Josiah Macy, Jr. Foundation. This volume re- 
ports two sets of such conferences: a one-day conference held at 
the New York Academy of Medicine on November 30, 1951, re- 
ported in the first section of the book, The subcommittee held the 
second conference which lasted for two days without any prepared 
papers except a listing of subjects for discussion to be presented to 
the group. This was held at the New York Academy of Medicine 
on March 13 and 14, 1952, and the supposedly spontaneous discus- 
sion was recorded and is presented in the second section of the 
book. 

One of the conclusions that came out of the first conference was 
a more alive awareness on the part of the participants that drug 
addiction among adolescents is primarily a sociological, psychologi- 
cal, and economic problem and only, secondarily, a medical one. 
There were some participants who emphasized the abnormal psy- 
chology or medical slant. But, generally, a realization that only a 
multiprofessional approach was at all promising seemed to hold 
sway. Almost every phase of the problem was considered; not in any 
thorough going fashion, however. Some of the problems concerned 
with were the need for elimination of concepts of moral depravity 
that doctors still have; the comparisons based on reminiscences of 
some of the participants with factors which were realized thirty 
years ago; the reason for the increase in adolescents’ use of drugs; 
how the craving could be prevented; how the supply can be 
stopped; how to rehabilitate those who have been addicted; the 
role of education; what can be done in an institution; and what 
can be done in other directions including the education of parents. 
All of these and other related problems are considered in these 
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conferences. One of the direct results of these two conferences was 
the inauguration of a project to correlate and integrate data on 
adolescent drug addiction gathered by the magistrates’ courts of 
the city of New York. With the assistance of the United States 
Public Health Service, the project is now being carried on in New 
York University. 

A small agenda committee might have prepared the material so 
that more alive and present-day efforts would be considered from 
the point of view of action for research and practice. But the Maty 
Foundation deserves congratulation on arranging for this confer- 
ence and in helping in the publication of the materials of it. People 
in the field will not find this hard reading; they will find it segmented 
and piece-meal, however. Short comprehensive summaries would 
have helped clarify the presentation for the reader. The nearest 
thing to a feature of this kind shows up in the early portion of the 
second conference where Dr. E. H. L. Corwin is invited to sum- 
marize the first conference. For orientation purposes the confer- 
ences evidentally served. The appendix contains a glossary of terms 
used from drug addiction—argot, this is called—and also an in- 
dex. In a short but enlightened introduction by Dr. Frank Fre- 
mont-Smith, Medical Director of the Josiah Macy, Jr. Foundation, 
the purpose and significance of these conferences is expressed more 
coherently than in the other contributions. ‘‘What has developed,” 
says he, “‘is a much clearer recognition that drug addiction in ado- 
lescents must be seen, not as a problem of moral degeneracy, not 
merely as a failure of law enforcement, but rather as one symptom 
of the serious deprivations suffered by many children living in 
large and crowded cities.”’ H. MELTZER 

Psychological Service Center 

St. Louis, Missouri 


M. E. Bennett. College and Life. New York: McGraw-Hill Book 
Co., 1952, pp. 457. $4.50. 


The author says that College and Life ‘‘deals with the various 
problems of learning and living in college, which each student must 
face and solve for himself as a democratic citizen in his college 
community in order to achieve the real values of a college educa- 
tion.” Again, ‘“‘Vocational guidance and mental hygiene form the 
warp and woof of the materials throughout the book...” 
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There are three parts to the book. In the first of these, entitled 
“Living in College,” the general college scene is discussed from 
various points of view. The student is asked to consider seriously 
whether or not he is one of those who really should be in college. 
What the student may expect to get out of college in return for the 
time and money invested is discussed. Such aspects of college 
living as extra-curricular activities, customs and traditions of one’s 
college, time budgeting, kinds of majors, trends in college curricula, 
importance of scholarship, participation in school politics as an 
effective college citizen, and learning self-direction come under 
consideration. The aim is stressed of achieving effective local, 
national, and international citizenship. Bennett says ‘“‘College ex- 
perience should provide every student with some excellent training 
in how to choose a leader, how to follow a leader, and how to be 
a leader.’’ These are sound aims, too often neglected in the thinking 
of persons responsible for the training of students of all ages, and 
woefully absent in the thinking of the students themselves. 

In Part Two, ‘‘Learning in College,’ four chapters are given to 
the nature and conditions of learning and retention, and to effective 
study habits. Chapter 8, the discussion of conditions for effective 
learning, would need implementation by way of reading or class 
discussion. The usefulness of the study-conditions and health in- 
ventories provided in Chapter 9 is limited by the student’s probable 
lack of adequate criteria for answering. The exercises for improving 
memory in Chapter 10 may also lose in effectiveness from the stu- 
dent’s inability to evaluate his faults and to outline changes in his 
study practices. The remaining five chapters of this section touch 
upon use of the library, effective silent reading, note-taking, think- 
ing with a purpose, and the appraisal of progress in college. Gener- 
ally, these chapters contain much of value to the beginning student. 
However, such exercises as the evaluation of reference materials 
call for knowledge and experience not possessed by many fresh- 
men. The usual freshman would also need a great deal of help with 
the reading inventory, inasmuch as he simply would be unable to 
make some of the judgments required. The chapter on purposeful 
thinking seems quite good, and includes exercises which should be 
of value. The chapter on appraisal of progress should help to reduce 
the student’s fear of examinations through understanding of their 
uses, and give him techniques for doing well in them, other things 
being equal. 
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Part Three, “Building a Life,’’aims at the development of per- 
sonality, achieving mental health, developing a life philosophy, 
and achieving satisfactory adjustments in family life and in demo- 
cratic citizenship. As is the case with the other sections, there is a 
lot here of value for the beginning student, but much of it is pre- 
sented in rather a shallow manner, and the exercises for the student 
call for more experience and knowledge than he is likely to have. 
It would seem that the demands made upon the instructor, if the 
material is to benefit the student very much, are great. , 

It appears to this reviewer that throughout the book the exercises 
and indicated discussions would call for a counselor considerably 
better-trained than the usual ‘mere’ instructor. The somewhat 
elementary level of presentation of much of the material may be 
justified in view of its intended use with beginning college students, 
but adequate use of the material to bring about good adjustment 
and recognition and acceptance of the stated goals on the part of 
the students would require the services of a teacher well-trained 
in clinical psychology or counseling. If such a person were available, 
this book should provide a basis for an excellent orientation course. 
The 405-item bibliography contains excellent and recent references 
for students at all ability levels. Ceci M. FREEBURNE 

Bowling Green (Ohio) State University 


Leopotp BELLAK, Ed. Psychology of Physical Illness. Psychiatry 
Applied to Medicine, Surgery, and the Specialities. New York: 
Grune and Stratton, 1952, pp. 243. 


This is a very little book on a very big subject. Perhaps the 
magnitude of the field justifies the two titles carried by this work, 
but most certainly its contents do not. The subtitle is accurate, 
but the primary title is highly misleading. Of psychology, as psy- 
chologists know it, this book contains nothing. The last statement 
must be tempered only by recognition that the one chapter which 
appears to show some semblance of effort at scholarly review of 
existing literature (Chapter 5, on Psychiatric Problems in General 
Surgery) has one reference to a paper co-authored by a psychologist 
and dealing with a scientific, psychological study of determinants 
of post-surgery recovery. This paper receives no treatment in the 
text but is simply given super-script reference together with three 
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other papers which, like the great bulk of the references used by 
the various authors, are clinical accounts of and superficial hypothe- 
ses about psychiatric phenomena associated with illness and physi- 
cal defect. 

This is a book by M.D’s for M.D.’s. More specifically, it is a 
collection of brief papers by psychoanalytically trained or oriented 
medical specialists, physicians who obviously consider analytic con- 
cepts as both valid and efficient as a basis for 1) investigating, 2) 
understanding, and 3) controlling the doctor-patient relationship. 
In these papers, emphasis is upon understanding of the medical 
patient’s psychological problems primarily within the framework 
of the abstractly descriptive terminology of psychoanalysis. Sci- 
entific investigation (as contrasted with clinical accounting) and 
recommendations as to techniques of management are generally 
given short shrift in preference to relatively stereotyped accounts 
of the ‘meaning to the patient’ of his particular symptom or de- 
fect. Much use is made of the ‘body image’ concept with little 
effort to outline its developmental psychological basis. 

Of the possible and probable purposes which a book carrying the 
titles of this one might have, it is difficult to find that any one has 
been pursued. It is very obviously not a comprehensive review of 
psychological data or information concerning physical illness or 
handicap in the sense of Pintner, et al., The Psychology of the Physi- 
cally Handicapped, or Barker, et al., Adjustment to Physical Handi- 
cap and Illness. As a matter of fact, the goal and effort represented 
in outcome in these two works is considered by the editor of this 
book as ‘nearly impossible’. This book is obviously not a report of 
medical and psychological research carried on by a team of investi- 
gators concerned with any particular sample of patients, as exem- 
plified by Keys, et al., Biology of Human Starvation. It is obviously 
not an attempt to review the different theories and systematic 
formulations of outstanding psychological and medical experts con- 
cerning psychiatric implications of illness. 

What of the stated purpose of the book? According to the editor, 
it is offered ‘‘with the hope that we may contribute to a better medi- 
cal practice—one which is more enjoyable for the doctor and more 
beneficial to the patient.’’ The exact way in which this contribution 
is to be achieved is not stated explicitly. Certainly it is clear from 
the content that the contribution is not to be in the form of factual 
information (e.g., frequency of significant psychiatric complications 
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in surgical procedures). Nor is the contribution to be in terms of 
reasonably detailed recommendations concerning the management 
of the emotionally involved patient (e.g., possible techniques for 
effecting a psychiatric referral of a patient with multiple sclerosis). 
Nor is this book’s contribution to a ‘better medical practice’ to be 
in the form of challenging questions and the mapping of unexplored 
areas. This book raises no questions and in no way suggests that 
there are any areas of ‘psychology’ in medical practice that require 
objective investigation. So far as may be determined from this col- 
lection of little essays, the ‘psychology of physical illness’ is that 
contradiction in terms—a finished science. 

What of the actual content? This is an edited collection of fifteen 
papers by sixteen authors. The medical areas covered include gen- 
eral practice, internal medicine, malignancy, general surgery, plas- 
tic surgery, gynecology and obstetrics, urology, orthopedics, eye, 
ear, nose and throat, neurology, pediatrics, dermatology, dentistry. 
The final chapter is on the ‘Personality of the Physician as a 
Factor.” In view of the pervasive psychoanalytic orientation of the 
book, one can well imagine the editor’s frustration at having to 
dispense with a chapter on proctology because he was not ac- 
quainted with any expert in the rectal field who was “at the same 
time competent in psychiatry.” 

With the exception of the editor, the writers are not nationally 
prominent in either psychology or psychiatry. This may account 
for the brief biographical sketch which precedes each chapter and 
helps to establish for the reader the special competence of the par- 
ticular writer. The biographical similarity of the writers is most 
striking. All sixteen are residents of New York City. With the ex- 
ception of two private practitioners, the remaining fourteen con- 
tributors have formal affiliation with one or another of five promi- 
nent New York medical institutions. Half of them are associated 
with either Mount Sinai Hospital or Columbia University. Seven 
of the authors are formally affiliated with the New York Psycho- 
analytic Society. Only half of the authors record any previous pub- 
lications in the references appended to their chapters. The chapter 
reference lists which range from six to fifty-eight items in length, 
and average nineteen items, are somewhat expansively termed 
‘bibliographies’. 

The general impression left with this reader by both the chapter 
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contents and the biographical data on the writers is that their 
primary qualifications consisted of convenient geographical ac- 
cessibility to the editor and a happy homogeneity of philosophical 
(i.e., psychoanalytic) conviction. The collaboration and editorial 
direction have resulted in a work of surprisingly even quality in 
contrast to the very marked variation of style, organization, em- 
phasis, and thoroughness which is a common criticism of similar 
compendiums. Unfortunately, the desirable evenness of quality is 
achieved by a failure on the part of the participants to raise them- 
selves above the platitudinous level of observations that the sick 
person has ‘feelings’ about his sickness and that it is important for 
the physician to recognize and deal with these feelings. The chapter 
on dentistry, for example, is lush with unqualified statements about 
how the “usual patient feels about his mouth,” about tooth decay, 
and about dental therapeutics, but there is no indication of even 
an awareness on the part of the author of the extensive research 
literature on psychological determinants of individual differences in 
pain reaction. ‘Pain’ has but a single reference in the index of this 
book and the term ‘limen’ appears not at all. The one chapter which 
stands above all the others because of its unhackneyed observa- 
tions, the subtlety of its insights into the implications of established 
facts of psychology, and its emphasis upon the importance of the 
patient’s information and control of a medical procedure as an 
anxiety alleviant, is the chapter on general surgery. Typical of this 
writer is his recommendation that operating room personnel should 
“be aware of the fact that during the induction of general anesthe- 
sia, hearing is the last sense category to be obliterated and that 
profound impressions can come by way of the partly narcotized 
hearing apparatus.” 

Books whose authors and publishers assign titles which neces- 
sarily suggest appropriate comparison with other similarly titled 
works, or whose titles arouse natural expectations as to content, 
must bear the test of such comparisons and expectations. A com- 
parison of this little book with a big title against such works as 
those of Pintner and Barker indicates differences both of goal and 
achievement of the magnitude of the differences between a Sunday 
picnic and the French expedition to the top of Annapurna. In view 
of the strongly critical flavor of this review and the current revival 
of ‘book-burning’ one might ask if this is a dangerous or harmful 
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book. It is not. It may be perused safely by medical students, gen- 
eral practitioners, and specialists—even by the physically-ill per- 
son. It is unlikely to add to the knowledge or understanding of 
any of these, but it most definitely will not harm them. It is not 
that good. WILLIAM SCHOFIELD 


University of Minnesota 








